7-9 May 2025
View the Project on GitHub Health-Research-From-Home/DataAnalysisChallenge
Smartphones and wearables provide many important opportunities to advance both clinical care and population health research. They allow the collection of new types of data including self-reported symptoms such as pain, or passively measured sensor data like physical activity. Quick surveys and passive data also allow us to measure the day-to -day changes in disease.
Chronic pain is one area of healthcare that can significantly benefit from these opportunities. Frequent data collection allows us to understand changing patterns of disease through time, identify (time-varying) factors that influence pain, measure and identify early signals of treatment response, or even predict flares and intervene in a timely way. However, collecting detailed, accurate data from patients is often time-intensive, costly, and challenging. Furthermore, developing and testing novel analysis methods to handle rich time-series data in order to achieve the above objectives can be hampered by the availability of large-scale datasets or incomplete data – either because of governance limitations or just because this is a relatively new field of research.
This hackathon focuses on designing and testing innovative approaches to simulate daily tracked symptom data and to predict pain dynamics, incorporating real-world challenges like optimising sampling frequency.
Below are the two tasks for the hackathon:
In this challenge, you’ll build a simulation that generates daily pain measurements for a set of patients over a certain period. Unlike a standard static dataset, this simulation should capture underlying patterns of disease that will be explained during the hackathon by clinical and patient partners. The end goal is to produce a realistic synthetic dataset that can be used in subsequent modeling tasks.
More details will be provided during the hackathon.
The result of your simulation will be evaluated by the domain experts. While there is no explicit “forecasting score,” the simulation should meet the following criteria:
You are encouraged to be creative in modeling elements with different parameters to enhance the simulator. The ultimate goal is to produce a coherent, flexible simulator that generates daily pain measurements influenced by mock treatments over a certain time period.
Imagine you have data on a number of patients, each with daily pain measurements recorded over a certain time period. However, in the real world, collecting patient-reported outcomes can be demanding and time-consuming. Your mission is to forecast patients’ pain measurement with high accuracy—while also minimizing the number of data points you request to retrieve.
More details will be provided during the hackathon.