Open source datasets for medical AI

The Centaur Blogging Team

December 3, 2020

We live and breathe medical datasets for AI

‍

Transforming healthcare through AI hinges upon access to diverse, richly annotated, open-source datasets. Medical AI applications—from image segmentation to multimodal reasoning—are only as robust as their training data. Fortunately, recent years have yielded high-quality, freely available datasets that researchers can leverage to build, test, and deploy impactful models. We thought it would be helpful to put some of our favorite open-source datasets in an organized list and share them with the community.

‍

In our list, you can explore dozens of datasets by size, category, modality (including X-ray, ultrasound, Whole Slide Images, CT scans, ECGs), and more. Additionally, we have included a brief description that helps you to quickly understand the specific abnormalities of interest, the balance of the data, and information about annotations included, such as medical image classifications or segmentations.

‍

Our collection of open source datasets for medical

‍

Access the full collection here

‍

If you know of any datasets that should be added to this list, please let us know.

Accurate and scalable data labeling and model evaluation

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

Subscribe to our monthly newsletter

Open source datasets for medical AI

We live and breathe medical datasets for AI

Related posts

Consensus teams up with Centaur.ai to create data labels

AIMed interviews Centaur.ai CEO Erik Duhaime

How to 10X Your Data Labeling Pipeline