User Guide
Comprehensive instructions for using the Motion Health Predictor platform effectively
Introduction
Welcome to the Motion Health Predictor! This advanced web-based tool is designed for researchers, healthcare professionals, and individuals to assess health risks based on physical activity patterns. Our platform utilizes state-of-the-art LSTM and Self-Attention deep learning algorithms to provide accurate, interpretable health risk predictions across 327 ICD-10 coded diseases.
The system analyzes 192-dimensional movement data representing activity proportions across weekdays and weekends, including four activity categories: sedentary behavior, light activity, moderate-vigorous activity, and sleep patterns.
Frequently Asked Questions
A: The platform offers two main pathways for health assessment:
Personal Assessment Pathway:
- Navigate to the "Personal Assessment" page
- Input basic information (age, gender, BMI)
- Choose between PPDM (basic) or PPDM+ (enhanced with blood tests) model
- Upload movement data CSV file or use example data
- Click "Assess Risk" for comprehensive health evaluation
Professional Analysis Pathway:
- Access model performance metrics and feature importance
- View AUC, sensitivity, specificity across all 327 diseases
- Analyze model interpretability with SHAP values
A: The system requires specific CSV formats depending on the chosen model:
• 196 columns total
• Columns 1-4: Basic Information (Participant ID, Age, Gender, BMI)
• Columns 5-100: Weekday activity data (96 values)
• Columns 101-196: Weekend activity data (96 values)
Activity Data Structure (per hour):
• Column 1: Sedentary proportion (0-1)
• Column 2: Light activity proportion (0-1)
• Column 3: Moderate-vigorous proportion (0-1)
• Column 4: Sleep proportion (0-1)
• Additional blood test parameters:
- WBC, RBC, Hemoglobin, Platelets
- ALT, AST, Glucose, Cholesterol
• Can be uploaded separately or entered manually
All activity proportion values must be between 0 and 1, representing the fraction of each hour spent in that activity category.
A: The health risk assessment provides comprehensive, multi-dimensional results:
Risk Classification System:
Population Percentile Display: See how your risk compares to others in your demographic group with clear percentage rankings and risk interval displays.
Comprehensive Results Dashboard Includes:
A: Our platform utilizes cutting-edge machine learning architectures:
Core Technologies:
- LSTM Networks: Capture temporal patterns in 24-hour activity data across multiple days
- Self-Attention Mechanisms: Identify critical time periods and activity types
- SHAP Analysis: Provide transparent, interpretable feature importance
- LightGBM Ensemble: For disease-specific risk classification
Model Validation: All models undergo rigorous nested leave-one-site-out cross-validation with bootstrap resampling (1,000 iterations) to ensure reliability and generalizability.
Performance Metrics Reported: AUC, accuracy, sensitivity, specificity, precision, Youden index, and F1 score with 95% confidence intervals.
A: Our models demonstrate exceptional performance across multiple validation studies:
Validation Framework: Nested cross-validation across 22 recruitment centers ensures robust performance estimates and prevents overfitting.
Technical Support
For optimal experience and troubleshooting:
- Ensure CSV files follow the required format exactly
- Verify all activity values are between 0 and 1
- Check for exactly 196 columns in movement data files
- Use the example data template as a reference
- Contact support through the Contact page for assistance
- Review the Privacy & Terms for data handling policies