InfoBayAI 's Collections

Academic Textbook Corpora for LLM Training

Sample of a 2.6+ word textbook corpus across 39K+ books, 5K+ subjects, and 15 languages for LLM training and multilingual knowledge modeling.