Government datasets available to the public:
Bureau of National Statistics (stat.gov.kz):
This is the official platform for obtaining data on demographic, economic, and social indicators of Kazakhstan. The portal provides detailed statistics on various sectors of the economy and society, including information on the population, economy, industry, and much more.
Open Data (EGOV) (data.egov.kz):
This is the platform for Kazakhstan's government open data, providing access to diverse datasets published by government agencies. On this portal, you can find information on transport, environment, healthcare, education, and other areas.
Smart Data Ukimet (SDU) (nitec.kz/ru/proekty/smart-data-ukimet):
This project focuses on the use of data and analytics to improve public administration processes in Kazakhstan. The Smart Data Ukimet platform enables the collection and analysis of large volumes of data to make more informed decisions in government bodies.
Introduction to open-source datasets for AI training and development:
Hugging Face:
This is one of the leading open platforms for developing artificial intelligence and machine learning. Hugging Face provides tools for natural language processing, computer vision, and other tasks, as well as access to pre-trained models, datasets, and libraries for creating advanced AI applications.
GitHub:
GitHub hosts repositories with numerous useful datasets and information on artificial intelligence. In these repositories, you can find code, documentation, and data that help developers and researchers accelerate the creation of machine learning, natural language processing, and computer vision models.
Kaggle:
Kaggle is a popular platform for data analysis and machine learning competitions. It offers an extensive collection of datasets and interactive Jupyter Notebooks that allow you to write and execute code for data analysis and model development directly in the browser. The platform also supports an active community for knowledge sharing and collaboration on AI projects.
Partner Data and Directions for AI Projects:
Open Datasets from Partners by Directions:
Collecting and publishing open datasets provided by partners, organized by their key areas of activity. Each dataset is accompanied by a brief description, allowing users to quickly find the data they need for their AI projects. The focus of the datasets is determined by the priority areas of the technology park, including technological fields such as Geographic Information Systems (GIS), blockchain, hardware technologies, and others.
Searching for and Establishing Partnerships for Providing Datasets:
We are actively seeking partners from both the public and private sectors who are willing to contribute to the development of the AI ecosystem in Kazakhstan by providing their data. Collaborating with us offers unique opportunities for companies: your data will become the foundation for innovative projects, gain recognition in the AI community, and be used within the technology park’s programs to create advanced solutions in priority technological areas. Join us to shape the future of technology together and unlock the potential of data for new achievements.