Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
J
JayTongue
Follow
0 followers
·
2 following
AI & ML interests
None yet
Recent Activity
replied
to
conceptofmind
's
post
19 days ago
Teraflop AI is excited to help support the Caselaw Access Project and Harvard Library Innovation Lab, in the release of over 6.6 million state and federal court decisions published throughout U.S. history. It is important to democratize fair access to data to the public, legal community, and researchers. This is a processed and cleaned version of the original CAP data. During the digitization of these texts, there were erroneous OCR errors that occurred. We worked to post-process each of the texts for model training to fix encoding, normalization, repetition, redundancy, parsing, and formatting. Teraflop AI’s data engine allows for the massively parallel processing of web-scale datasets into cleaned text form. Link to the processed dataset: https://huggingface.co/datasets/TeraflopAI/Caselaw_Access_Project The Caselaw Access Project dataset is licensed under the CC0 License. We plan to release trillions of commercially licensed text tokens, images, audio, videos, and other datasets spanning numerous domains and modalities over the next months. If you are interested in contributing commercially licensed data be sure to reach out: https://twitter.com/EnricoShippole Follow us for the next collaborative dataset releases: https://twitter.com/TeraflopAI
updated
a dataset
9 months ago
JayTongue/images_by_topic
updated
a dataset
9 months ago
JayTongue/images_by_topic
View all activity
Organizations
None yet
models
0
None public yet
datasets
1
JayTongue/images_by_topic
Viewer
•
Updated
Nov 16, 2024
•
5.33k
•
187