Add data preparation scripts for UK Biobank analysis
- Introduced `prepare_data.R` for merging disease and other data from CSV files. - Added `prepare_data.py` for processing UK Biobank data, including: - Mapping field IDs to human-readable names. - Handling date variables and converting them to offsets. - Processing disease events and constructing tabular features. - Splitting data into training, validation, and test sets. - Saving processed data to binary and CSV formats.
This commit is contained in:
1129
icd10_codes_mod.tsv
Normal file
1129
icd10_codes_mod.tsv
Normal file
File diff suppressed because it is too large
Load Diff
Reference in New Issue
Block a user