Microsoft creates COVID-19 open research dataset, including more than 29,000 academic papers

Through collaborations with the National Library of Medicine (NLM), the Allen Institute for Artificial Intelligence, Georgetown University, the Chan and Zuckerberg Foundation, Kaggle (Data Modeling and Data Analysis Competition Platform), and the White House Office of Science and Technology Policy (OSTP), Microsoft recently announced the successful creation of the COVID-19 Open Research Data Set (CORD-19), which integrates scientific articles on new coronaviruses for research purposes.

Microsoft creates COVID-19 open research dataset, including more than 29,000 academic papers

The CORD-19 database contains more than 29,000 academic articles on COVID-19 and the coronavirus family, with 13,000 full-text readings. Articles in these databases are designed to be machine-readable to allow researchers to access, analyze, and mine data using AI tools to drive algorithm icing and outbreak-related research.

Microsoft has provided indexes and mappings to thousands of articles around the world, and has pledged to continue to update the index to provide a unified, continuously updated resource to the global research community to bring together as many studies as possible about COVID-19. Microsoft hopes this will improve research and discovery efficiency and speed up the process of addressing epidemics.