dba57642
tangwang
bayes调参计划
|
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
|
# Coarse Fusion 长跑调参
## 启动一轮长跑
```bash
./scripts/evaluation/start_coarse_fusion_tuning_long.sh
```
可用环境变量:
```bash
MAX_EVALS=48 BATCH_SIZE=3 CANDIDATE_POOL_SIZE=512 \
RUN_NAME=coarse_fusion_long_001 \
./scripts/evaluation/start_coarse_fusion_tuning_long.sh
```
启动后会打印:
- `run_name`
- `pid`
- `log`
- `run_dir`
默认搜索空间:
- `scripts/evaluation/tuning/coarse_rank_fusion_space.yaml`
默认 baseline seed:
- `artifacts/search_evaluation/batch_reports/batch_20260415T150754Z_00b6a8aa3d.md`
## 查看进度
```bash
tail -f artifacts/search_evaluation/tuning_launches/<run_name>.log
cat artifacts/search_evaluation/tuning_runs/<run_name>/leaderboard.csv
sed -n '1,200p' artifacts/search_evaluation/tuning_runs/<run_name>/summary.md
```
实时记录文件:
- `trials.jsonl`
- `leaderboard.csv`
- `summary.json`
- `summary.md`
## 续跑
```bash
./scripts/evaluation/resume_coarse_fusion_tuning_long.sh <run_name>
```
也可直接传完整目录:
```bash
./scripts/evaluation/resume_coarse_fusion_tuning_long.sh \
artifacts/search_evaluation/tuning_runs/<run_name>
```
## 停止
```bash
kill "$(cat artifacts/search_evaluation/tuning_launches/<run_name>.pid)"
```
## 说明
- 每轮会自动写入 `config/config.yaml`
- 每轮会自动执行 `./restart.sh backend`
- 如果 `eval-web` 因 backend 重启不可用,调参器会尝试补拉起 `eval-web`
- 默认不 `apply-best`,跑完后会恢复 baseline 配置
|