Refactor evaluation scripts for multi-GPU execution

- Removed `run_evaluations_multi_gpu.sh` script as it was redundant.
- Updated `run_experiments_multi_gpu.sh` to handle evaluation jobs instead of training.
- Changed command-line options to support evaluation-specific parameters.
- Implemented run directory discovery and validation for evaluation jobs.
- Enhanced logging to capture evaluation details and outputs.
- Added options for centralized output management and skipping existing results.
This commit is contained in:
2026-01-18 17:38:20 +08:00
parent b80d9a4256
commit 0057bc0dd9
3 changed files with 1391 additions and 413 deletions

1245
evaluate.py Normal file

File diff suppressed because it is too large Load Diff