
K-12 Education Dataset (Teaching Data Series)
HK$49,999.00 - HK$88,888.00
Approx. 60 million high-quality K-12 records.
Delivered in plain-text JSON format (sample available).
Two separate corpora: 25 million Chinese records and 29 million English records—zero overlap.
Chinese data are sourced from schools in Mainland China, Hong Kong and other Chinese-speaking regions; English data come from the United States, United Kingdom, Canada and other English-speaking regions.
Each record includes: ID, question, (some with full prompt), answer, explanation, subject, grade level, difficulty. All formulae are rendered in paired LaTeX “$…$”.
Covers every core K-12 subject—Chinese, Mathematics, English, Science, Physics, Chemistry, Biology, History, Geography, Civics, etc.
Available as separate Chinese or English datasets, or as a combined bilingual set for multilingual model training. Purchase both together and receive a 20 % discount.
K12 Dataset Overview
Approx. 60 million high-quality K-12 records.
Delivered in plain-text JSON format (sample available).
Two separate corpora: 25 million Chinese records and 29 million English records—zero overlap.
Chinese data are sourced from schools in Mainland China, Hong Kong and other Chinese-speaking regions; English data come from the United States, United Kingdom, Canada and other English-speaking regions.
Each record includes: ID, question, (some with full prompt), answer, explanation, subject, grade level, difficulty. All formulae are rendered in paired LaTeX “$…$”.
Covers every core K-12 subject—Chinese, Mathematics, English, Science, Physics, Chemistry, Biology, History, Geography, Civics, etc.
Available as separate Chinese or English datasets, or as a combined bilingual set for multilingual model training. Purchase both together and receive a 20 % discount.
Purchase Options
English Version — 29 million records (no overlap with Chinese edition)
Chinese Version — 25 million records (no overlap with English edition)
Bilingual Version (Chinese + English) — 60 million records (20 % bundle discount)
Data Details
Aspect Description
Volume ~60 million education records—ample scale for deep-learning model training.
Languages Chinese and English versions can be used independently or jointly for cross-lingual enrichment.
Subject Coverage Full K-12 curriculum across all major disciplines.
Hierarchical Structure Clearly layered taxonomy by grade, subject, question type and difficulty—highly structured for targeted filtering.
Formats Provided in JSON (and optional CSV); each entry contains ID, question, (prompt), answer, explanation, subject, grade, difficulty, etc.
Compliance Legitimate, rigorously screened sources—content is reliable and free of copyright disputes; safe for commercial or research use.
Extensibility Custom subsets or subject-specific augmentations available to match project requirements.
💳 Payments & Currency
We support multiple payment methods, including VISA, Alipay, and more (please contact customer service for other options).
All transactions are settled in Hong Kong Dollars (HKD). The system will automatically convert the amount to your local currency at the current exchange rate.
For large purchases (over HKD 100,000), please contact us for special payment options such as Corporate Alipay, Bank Transfers, or USDT.
🇭🇰 Hong Kong users: For AlipayHK, WeChat Pay HK, or FPS (Faster Payment System), please contact us for a Hong Kong-specific payment link.
🇸🇬 Singapore users: If paying via PayNow, please contact us for a Singapore-specific payment link.
📦 Delivery & Service
All products available for purchase are in stock. Upon successful payment, your order will be automatically delivered to your email.
For more information on our services and after-sales policies, please refer to our Terms of Service and Privacy Policy.