instruction-pretrain/instruction-synthesizer Text Generation • 7B • Updated Mar 1, 2025 • 28 • 79
Running 132 TxT360: Trillion Extracted Text 📖 132 Explore and analyze the TxT360 dataset for LLM pre-training