Skip to content
llm-datasets: Documentation
Framework overview
Initializing search
GitHub
llm-datasets: Documentation
GitHub
Home
Getting started
Framework overview
Framework overview
Table of contents
Data schema
Pipeline
Available datasets
Config files
Extract text data
Adding your own data
Compose training and validation dataset
Integration with other frameworks
Related work
API reference
API reference
BaseDataset
HFDataset
JSONLDataset
Config
Table of contents
Data schema
Pipeline
Framework Overview
Data schema
Pipeline