Files
DeepHealth/field_ids_enriched.csv
Jiarui Li 9ca8909e3a Add data preparation scripts for UK Biobank analysis
- Introduced `prepare_data.R` for merging disease and other data from CSV files.
- Added `prepare_data.py` for processing UK Biobank data, including:
  - Mapping field IDs to human-readable names.
  - Handling date variables and converting them to offsets.
  - Processing disease events and constructing tabular features.
  - Splitting data into training, validation, and test sets.
  - Saving processed data to binary and CSV formats.
2025-12-04 11:26:49 +08:00

2.7 KiB

1field_instancefull_namevar_name
231-0.0Sexsex
334-0.0Year of birthyear
448-0.0Waist circumferencewaist_circumference
549-0.0Hip circumferencehip_circumference
650-0.0Standing heightstanding_height
752-0.0Month of birthmonth
853-0.0Date of attending assessment centredate_of_assessment
974-0.0Fasting timefasting_time
10102-0.0Pulse rate automated readingpulse_rate
111239-0.0Current tobacco smokingsmoking
121558-0.0Alcohol intake frequency.alcohol
134079-0.0Diastolic blood pressure automated readingdbp
144080-0.0Systolic blood pressure automated readingsbp
1520150-0.0Forced expiratory volume in 1-second (FEV1) Best measurefev1_best
1620151-0.0Forced vital capacity (FVC) Best measurefvc_best
1720258-0.0FEV1/ FVC ratio Z-scorefev1_fvc_ratio
1821001-0.0Body mass index (BMI)bmi
1921003-0.0Age when attended assessment centreage_at_assessment
2030000-0.0White blood cell (leukocyte) countWBC
2130010-0.0Red blood cell (erythrocyte) countRBC
2230020-0.0Haemoglobin concentrationhemoglobin
2330030-0.0Haematocrit percentagehematocrit
2430040-0.0Mean corpuscular volumeMCV
2530050-0.0Mean corpuscular haemoglobinMCH
2630060-0.0Mean corpuscular haemoglobin concentrationMCHC
2730080-0.0Platelet countPc
2830100-0.0Mean platelet (thrombocyte) volumeMPV
2930120-0.0Lymphocyte countLymC
3030130-0.0Monocyte countMonC
3130140-0.0Neutrophill countNeuC
3230150-0.0Eosinophill countEosC
3330160-0.0Basophill countBasC
3430170-0.0Nucleated red blood cell countnRBC
3530250-0.0Reticulocyte countRC
3630260-0.0Mean reticulocyte volumeMRV
3730270-0.0Mean sphered cell volumeMSCV
3830280-0.0Immature reticulocyte fractionIRF
3930300-0.0High light scatter reticulocyte countHLSRC
4030500-0.0Microalbumin in urineMicU
4130510-0.0Creatinine (enzymatic) in urineCreaU
4230520-0.0Potassium in urinePotU
4330530-0.0Sodium in urineSodU
4430600-0.0AlbuminAlb
4530610-0.0Alkaline phosphataseALP
4630620-0.0Alanine aminotransferaseAlanine
4730630-0.0Apolipoprotein AApoA
4830640-0.0Apolipoprotein BApoB
4930650-0.0Aspartate aminotransferaseAA
5030660-0.0Direct bilirubinDBil
5130670-0.0UreaUrea
5230680-0.0CalciumCalcium
5330690-0.0CholesterolCholesterol
5430700-0.0CreatinineCreatinine
5530710-0.0C-reactive proteinCRP
5630720-0.0Cystatin CCystatinC
5730730-0.0Gamma glutamyltransferaseGGT
5830740-0.0GlucoseGlu
5930750-0.0Glycated haemoglobin (HbA1c)HbA1c
6030760-0.0HDL cholesterolHDL
6130770-0.0IGF-1IGF1
6230780-0.0LDL directLDL
6330790-0.0Lipoprotein ALpA
6430800-0.0OestradiolOestradiol
6530810-0.0PhosphatePhosphate
6630820-0.0Rheumatoid factorRheu
6730830-0.0SHBGSHBG
6830840-0.0Total bilirubinTotalBil
6930850-0.0TestosteroneTestosterone
7030860-0.0Total proteinTotalProtein
7130870-0.0TriglyceridesTri
7230880-0.0UrateUrate
7330890-0.0Vitamin DVitaminD
7440000-0.0Date of deathDeath