Harvard FairVLMed with 10,000 Samples

Harvard FairVLMed with 10,000 Samples (Harvard-FairVLMed10k): This Harvard-FairVLMed10k dataset includes 10,000 samples from 10,000 patients to study the fairness issue in medical vision-language models. This dataset is used in our paper “FairCLIP: Harnessing Fairness in Vision-Language Learning" published in the 2024 Conference on Computer Vision and Pattern Recognition. The corresponding code is available on our GitHub repository FairCLIP. Here is the data download link for Harvard-FairVLMed10k. This dataset can only be used for non-commercial research purposes. At no time, the dataset shall be used for clinical decisions or patient care. The data use license is CC BY-NC-ND 4.0. If you have any questions about this dataset, please email harvardophai@gmail.com.

Note that, the modifier word “Harvard” in the dataset name “Harvard FairVLMed" only indicates that our dataset is from the Department of Ophthalmology of Harvard Medical School and does not imply an endorsement, sponsorship, or assumption of responsibility by either Harvard University or Harvard Medical School as a legal identity.

Check more Harvard Ophthalmology AI Datasets.