Blame view

scripts/evaluation/tuning/README.md 1.46 KB
dba57642   tangwang   bayes调参计划
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
  # Coarse Fusion 长跑调参
  
  ## 启动一轮长跑
  
  ```bash
  ./scripts/evaluation/start_coarse_fusion_tuning_long.sh
  ```
  
  可用环境变量:
  
  ```bash
  MAX_EVALS=48 BATCH_SIZE=3 CANDIDATE_POOL_SIZE=512 \
  RUN_NAME=coarse_fusion_long_001 \
  ./scripts/evaluation/start_coarse_fusion_tuning_long.sh
  ```
  
  启动后会打印:
  
  - `run_name`
  - `pid`
  - `log`
  - `run_dir`
  
  默认搜索空间:
  
  - `scripts/evaluation/tuning/coarse_rank_fusion_space.yaml`
  
  默认 baseline seed:
  
  - `artifacts/search_evaluation/batch_reports/batch_20260415T150754Z_00b6a8aa3d.md`
  
  ## 查看进度
  
  ```bash
  tail -f artifacts/search_evaluation/tuning_launches/<run_name>.log
  cat artifacts/search_evaluation/tuning_runs/<run_name>/leaderboard.csv
  sed -n '1,200p' artifacts/search_evaluation/tuning_runs/<run_name>/summary.md
  ```
  
  实时记录文件:
  
  - `trials.jsonl`
  - `leaderboard.csv`
  - `summary.json`
  - `summary.md`
  
  ## 续跑
  
  ```bash
  ./scripts/evaluation/resume_coarse_fusion_tuning_long.sh <run_name>
  ```
  
  也可直接传完整目录:
  
  ```bash
  ./scripts/evaluation/resume_coarse_fusion_tuning_long.sh \
    artifacts/search_evaluation/tuning_runs/<run_name>
  ```
  
  ## 停止
  
  ```bash
  kill "$(cat artifacts/search_evaluation/tuning_launches/<run_name>.pid)"
  ```
  
  ## 说明
  
  - 每轮会自动写入 `config/config.yaml`
  - 每轮会自动执行 `./restart.sh backend`
  - 如果 `eval-web` 因 backend 重启不可用,调参器会尝试补拉起 `eval-web`
  - 默认不 `apply-best`,跑完后会恢复 baseline 配置