Toolbox for Researchers - Second Edition: A Collection of 15 Thematic Databanks
Databanks help to share the results of experiments and measurements, play an important role in the formation of the academic environment and in the development process of specialists.
We will talk about datasets obtained using expensive equipment (the sources of this data are often large international organizations and scientific programs, most often related to the natural sciences), as well as state data banks.
Photo by Jan Antonin Kolar - Unsplash
Data.gov.ru - a government project in the field of open data that is familiar to hawkers. Its Moscow analogue - Data.mos.ru . Of the foreign options, it is worth noting Data.gov - a platform with open data from the US government (a single catalog with filters).
University Information System- MSU project, combining databases with statistical information on the social and economic situation in the country, as well as publications from state and scientific sources. Data are taken both from Rosstat and from studies conducted at Moscow State University. The resource can be used without prior registration, but for full access, you will need to apply.
Cartographic base of the All-Russian Geological Institute. Karpinsky. Information on the country's natural resources collected over the course of the institution’s existence has been printed on digital maps. The interface of the site allows you to map OpenStreetMap or I. Maps with a number of additional. layers with information about the magnetic field, minerals, etc.
GEOSS- A portal to search for Earth observation data from satellites and drones of various types. The resource archive is collected by 90 organizations around the world. To find information of interest, just select the desired area on the map or drive keywords into the search.
MAST is an archive funded by NASA. The data presented are collected by orbiting telescopes - you can study and download studies using the search with filters .
Photo Max Bender - Unsplash
OpenEI - a platform for finding open data on energy use, in particular about renewable energy resources and new technologies in the industry. The site is organized by the principle of a wiki - the accuracy of the data is checkedcommunity .
Experimental Nuclear Reaction Data (EXFOR) is a library containing data from 22,615 experiments with elementary particles. Complete with CINDA (Computer Index of Nuclear Reaction Data) and IBANDL (Ion Beam Analysis Nuclear Data Library) databases, it is one of the largest nuclear physics data banks. Supervised by Brookhaven National Laboratory in the USA, but contains experiments from around the world - including Russia and China .
National Centers for Environmental Information- archive of environmental data. Here you will get access to twenty petabytes of ocean and geophysical data, as well as information about the atmosphere and coastal zones. In particular, there is information on the depth of the ocean, the surface of the Sun, records of sedimentary rocks and satellite images. You can use the catalog to search for the desired dataset .
ADS- A repository for archaeological data managed by York University. There are old and new scientific publications, information about excavations and artifacts. There are three categories for searching: ArchSearch, Archives and Library. The first stores data on excavations and artifacts. In the second - an archive of all downloaded materials. In the third - publications from magazines, books and studies. There are search options for countries, eras and types of objects.
DRYAD - this service helps to search for information for scientific research on a databank of 80 thousand files. Research and articles from the bank can be used under license CC0 . The subject matter of materials includes various fields of knowledge, but most of the research is related to medicine and computer science. According to internalAccording to statistics , in 2018, site users were most interested in whale songs, temperature tolerance of marine inhabitants, and neural activity in the temporal lobe of the human brain.
In the laboratory “ Advanced Nanomaterials and Optoelectronic Devices ”, ITMO University
GenBank has a DNA library provided by the US National Center for Biotechnological Information (NCBI), as well as European and Japanese data banks. Available search for identifiers in a special search engine, with a tool BLAST or programmatically .
PubChem- A database of compounds and bioassays, which contains the US National Center for Biotechnology Information. There is a web interface with advanced search (an example about the side effects of water ). Data is disseminated in the public domain.
Protein Data Bank (RCSB PDB) is a bank of images of proteins and nucleic acids, the history of which dates back to 1971. It was originally developed as an internal project of the Brookhaven National Laboratory, but later turned into the largest international database of its type. Most academic journals related to biochemistry oblige authors to post protein models obtained during research on the site.
Interpro- A database that combines many datasets of various scientific projects. Includes SMART - a program for analyzing domains in protein sequences, based on machine learning technologies and a dataset of 1200 models. Supported by the European Institute for Bioinformatics.
Photo tours of ITMO University laboratories:
We will talk about datasets obtained using expensive equipment (the sources of this data are often large international organizations and scientific programs, most often related to the natural sciences), as well as state data banks.
Photo by Jan Antonin Kolar - Unsplash
Data.gov.ru - a government project in the field of open data that is familiar to hawkers. Its Moscow analogue - Data.mos.ru . Of the foreign options, it is worth noting Data.gov - a platform with open data from the US government (a single catalog with filters).
University Information System- MSU project, combining databases with statistical information on the social and economic situation in the country, as well as publications from state and scientific sources. Data are taken both from Rosstat and from studies conducted at Moscow State University. The resource can be used without prior registration, but for full access, you will need to apply.
Cartographic base of the All-Russian Geological Institute. Karpinsky. Information on the country's natural resources collected over the course of the institution’s existence has been printed on digital maps. The interface of the site allows you to map OpenStreetMap or I. Maps with a number of additional. layers with information about the magnetic field, minerals, etc.
GEOSS- A portal to search for Earth observation data from satellites and drones of various types. The resource archive is collected by 90 organizations around the world. To find information of interest, just select the desired area on the map or drive keywords into the search.
MAST is an archive funded by NASA. The data presented are collected by orbiting telescopes - you can study and download studies using the search with filters .
Photo Max Bender - Unsplash
OpenEI - a platform for finding open data on energy use, in particular about renewable energy resources and new technologies in the industry. The site is organized by the principle of a wiki - the accuracy of the data is checkedcommunity .
Experimental Nuclear Reaction Data (EXFOR) is a library containing data from 22,615 experiments with elementary particles. Complete with CINDA (Computer Index of Nuclear Reaction Data) and IBANDL (Ion Beam Analysis Nuclear Data Library) databases, it is one of the largest nuclear physics data banks. Supervised by Brookhaven National Laboratory in the USA, but contains experiments from around the world - including Russia and China .
National Centers for Environmental Information- archive of environmental data. Here you will get access to twenty petabytes of ocean and geophysical data, as well as information about the atmosphere and coastal zones. In particular, there is information on the depth of the ocean, the surface of the Sun, records of sedimentary rocks and satellite images. You can use the catalog to search for the desired dataset .
ADS- A repository for archaeological data managed by York University. There are old and new scientific publications, information about excavations and artifacts. There are three categories for searching: ArchSearch, Archives and Library. The first stores data on excavations and artifacts. In the second - an archive of all downloaded materials. In the third - publications from magazines, books and studies. There are search options for countries, eras and types of objects.
DRYAD - this service helps to search for information for scientific research on a databank of 80 thousand files. Research and articles from the bank can be used under license CC0 . The subject matter of materials includes various fields of knowledge, but most of the research is related to medicine and computer science. According to internalAccording to statistics , in 2018, site users were most interested in whale songs, temperature tolerance of marine inhabitants, and neural activity in the temporal lobe of the human brain.
In the laboratory “ Advanced Nanomaterials and Optoelectronic Devices ”, ITMO University
GenBank has a DNA library provided by the US National Center for Biotechnological Information (NCBI), as well as European and Japanese data banks. Available search for identifiers in a special search engine, with a tool BLAST or programmatically .
PubChem- A database of compounds and bioassays, which contains the US National Center for Biotechnology Information. There is a web interface with advanced search (an example about the side effects of water ). Data is disseminated in the public domain.
Protein Data Bank (RCSB PDB) is a bank of images of proteins and nucleic acids, the history of which dates back to 1971. It was originally developed as an internal project of the Brookhaven National Laboratory, but later turned into the largest international database of its type. Most academic journals related to biochemistry oblige authors to post protein models obtained during research on the site.
Interpro- A database that combines many datasets of various scientific projects. Includes SMART - a program for analyzing domains in protein sequences, based on machine learning technologies and a dataset of 1200 models. Supported by the European Institute for Bioinformatics.
Photo tours of ITMO University laboratories:
- We show the laboratory "Promising nanomaterials and optoelectronic devices"
- What do ITMO University's quantum materials lab do
- Mechanized arms and manipulators - what does the robotics laboratory do
- Tour of our cyberphysical laboratory
- ITMO University Fablab: DIY-coworking for creative people - show what's inside