
AI Training Dataset (Science Series) | Advanced Scientific Data | Large-Scale Model Training Corpus
HK$49,999.00 - HK$88,888.00
Contains about 100 million high-quality science records.
Delivered entirely in plain-text JSON format (see sample below).
Collected from multiple major online platforms and academic channels.
Of these, the Chinese edition has 400 billion entries, while the English edition has 5,000.
The Chinese and English datasets have zero overlap; each is sourced from different channels. Chinese data come from Mainland China, Hong Kong and other Chinese-speaking regions, while English data are gathered from the United States, United Kingdom, Canada and other English-speaking regions.
Every record includes: ID, title, (some with full problem statement), answer, step-by-step solution, subject, sub-discipline, grade category, difficulty, etc. (Formulae are rendered in paired LaTeX “$…$”.)
Covers all core scientific disciplines at senior-high, undergraduate, master’s and doctoral levels. Available in separate Chinese and English versions or as a combined bilingual set for multilingual model training. Purchase both together and enjoy a 10 % discount.
Science Dataset Overview
Contains about 100 million high-quality science records.
Delivered entirely in plain-text JSON format (see sample below).
Collected from multiple major online platforms and academic channels.
Of these, the Chinese edition has 400 billion entries, while the English edition has 5,000.
The Chinese and English datasets have zero overlap; each is sourced from different channels. Chinese data come from Mainland China, Hong Kong and other Chinese-speaking regions, while English data are gathered from the United States, United Kingdom, Canada and other English-speaking regions.
Every record includes: ID, title, (some with full problem statement), answer, step-by-step solution, subject, sub-discipline, grade category, difficulty, etc. (Formulae are rendered in paired LaTeX “$…$”.)
Covers all core scientific disciplines at senior-high, undergraduate, master’s and doctoral levels. Available in separate Chinese and English versions or as a combined bilingual set for multilingual model training. Purchase both together and enjoy a 10 % discount.
Purchase Options
English Version — 50 million records (0 overlap with Chinese edition)
Chinese Version — 40 million records (0 overlap with English edition)
Chinese + English Version — 100 million records
💳 Payments & Currency
We support multiple payment methods, including VISA, Alipay, and more (please contact customer service for other options).
All transactions are settled in Hong Kong Dollars (HKD). The system will automatically convert the amount to your local currency at the current exchange rate.
For large purchases (over HKD 100,000), please contact us for special payment options such as Corporate Alipay, Bank Transfers, or USDT.
🇭🇰 Hong Kong users: For AlipayHK, WeChat Pay HK, or FPS (Faster Payment System), please contact us for a Hong Kong-specific payment link.
🇸🇬 Singapore users: If paying via PayNow, please contact us for a Singapore-specific payment link.📦 Delivery & Service
All products available for purchase are in stock. Upon successful payment, your order will be automatically delivered to your email.
For more information on our services and after-sales policies, please refer to our Terms of Service and Privacy Policy.