Dataset Containing Federal Criminal Charge Labels And Reference Data

I am looking for a list of federal charges that I can use as reference data when extracting mentions of said charges from unstructured text. For example, such a list would include things like:

Possession with Intent to Distribute 50 Grams or More of Methamphetamine Possession with Intent to Distribute 28 Grams or More of a Mixture or Substance Containing Cocaine Base Possession with Intent to Distribute Cocaine Possession with Intent to Distribute Heroin

I know I can get text extracts of US Code – but what I am looking for is how I could detect something like “Possession with Intent to Distribute 50 Grams or More of Methamphetamine” in freeform text and then ideally crosswalk over to a reference in USC. (example50%20grams%20or%20more%20of%20methamphetamine%2C%20its%20salts%2C%20isomers%2C%20and%20salts%20of%20its%20isomers%20or%20500%20grams%20or%20more%20of%20a%20mixture%20or%20substance%20containing%20a%20detectable%20amount%20of%20methamphetamine%2C%20its%20salts%2C%20isomers%2C%20or%20salts%20of%20its%20isomers%3B)).

submitted by /u/thegrif
[link] [comments]

Leave a Reply

Your email address will not be published. Required fields are marked *