Ministry of Electronics and IT is pushing to complete deployment of 50 artificial intelligence curation units across government departments, aiming to feed its national AI data platform with quality datasets.
These units will sift through non-personal government data from healthcare, agriculture, logistics, and geospatial sectors, then integrate cleaned datasets into AI Kosh—the IndiaAI Datasets Platform. The goal is to have a centralized, usable data for AI model training and analytics..
The curation effort addresses a core AI development bottleneck—access to clean, relevant datasets. By standardizing fragmented government data, MeitY wants to unlock sovereign AI capabilities without dependence on foreign datasets.
The effort falls under the broader IndiaAI Mission, which combines compute infrastructure buildout, dataset curation, and indigenous model development. As global competition around AI data intensifies, India is leveraging its vast public sector data reserves to build domestic AI muscle.
AIKosh aims to become the default training ground for Indian AI applications, providing structured, verified datasets that cover demographics, environment, logistics, and more—all curated through this expanding network of ministry-level AI units.

