Data
You will be analysing the Antiretroviral Therapy in HIV dataset from HealthGym.ai.
The Health Gym is a growing collection of highly realistic and freely-accessible synthetic medical datasets that has been developed to allow users to prototype, evaluate, and compare machine learning algorithms.
The Antiretroviral Therapy in HIV dataset comprises viral loads, CD4 counts, and drug regimen information for 8,916 patients with HIV.
You can learn more about the dataset at healthgym.ai/antiviral-hiv and read about the data generation process in this paper.
Data versions and download links
Please note there are two versions of the Antiretroviral Therapy in HIV dataset. For the datathon we will be using Version 2. The Health Gym v2.0 Synthetic Antiretroviral Therapy (ART) for HIV Dataset can be downloaded here (figshare.com)
Version 1 of the dataset is currently posted on HealthGym.ai. The variable names and data structure are similar so if you have done some initial exploration using Version 1 you can easily transfer to Version 2.