}
No problem! Add your own private dataset to the karmadata platform. We'll standardize the data and you can access it in your own private sandbox, leveraging our existing data, query engine, and visualization tools.
Eric Schmidt once estimated that the total of all human knowledge created from the dawn of man to 2003 totaled 5 exabytes. IBM says that much data is being created every 2 days now. That is a lot of data. But much of it is noise. We are focused on data that matters to people. We gather data in two ways: by acquiring entirely new data sources, and by incrementally updating each existing data source as they grow. Members currently have access to more than 400k entities, 100 million standardized records, and more than 88 gigabytes of data.
Entities are the keys that unlock the potential of disparate data sources. We standardize these entities across data sources. Even better, you can identify the relationships that exist among these entities in a fact-driven way. That means ranking clinical sites by their trials within a disease, or drugs by the number of adverse events in a country.
280k clinical investigators mined from the most reputable clinical sources: ClinicalTrials.gov. Bioresearch Monitoring Information System, NIH Grant RePORTER, Clinical Investigator Inspection List, and PubMed. Rank investigators by their clinical experience with a disease or drug among many other filters.
Track an organization’s activity across a variety of data sources and roles: pharmaceutical and biotechs sponsoring drug development, clinical sites conducting clinical research, patent owners accruing intellectual property, law firms filing patents, hospitals, pharmacies, and more.
A database of the 1.8 million populated cities in the world, 65k of which we have identified in at least one of our 34 open data sources. Latitudes and longitudes are available for mapping.
Database of more than 9k drugs built using the NLM’s ChemID database, 6.5k of which we have mined facts from open data.
Leveraging the taxonomy work of the National Library of Medicine, we are currently tracking more than 4k diseases from the MeSH hierarchy (we also standardize to ICD-9 and MedDRA). Diseases are conveniently presented as both singular entities and also grouped into like terms via the MeSH hierarchy.