ai-saas / saas-search

09 Apr, 2026

2 commits

36516857 feat(product_enrich): 为产品富化模块增加 enriched_taxonomy_attributes ... Browse Code »

字段生成

- 新增分类法属性富化能力，遵循 enriched_attributes
  相同的字段结构和处理逻辑，仅提示词和解析维度不同
- 引入 AnalysisSchema
  抽象类，使内容富化（content）与分类法富化（taxonomy）共享批处理、缓存、提示词构建、Markdown
解析及归一化流程
- 重构 product_enrich.py 中原有的富化管道，将通用逻辑抽取至
  _process_batch_for_schema、_parse_markdown_to_attributes
等函数，消除代码重复
- 在 product_enrich_prompts.py
  中添加分类法提示词模板（TAXONOMY_ANALYSIS_PROMPT）及 Markdown
表头定义（TAXONOMY_HEADERS）
- 修复 Markdown
  解析器在空单元格时的行为：原实现会跳过空单元格导致列错位，现改为保留空值，确保稀疏的分类法属性列正确对齐
- 更新 document_transformer.py 中 build_index_content_fields 函数，将
  enriched_taxonomy_attributes（中/英）写入最终索引文档
- 调整相关单元测试（test_product_enrich_partial_mode.py
  等）以覆盖新字段路径，测试通过（14 passed）

技术细节：
- AnalysisSchema 包含
  schema_name、prompt_template、headers、field_name_prefix 等元数据
-
缓存键区分内容/分类法：`enrich:{schema_name}:{product_id}`，避免缓存污染
- 分类法解析使用与 enriched_attributes
  相同的嵌套结构：`{"attribute_key": "value"}`，支持多行表格
- 批处理大小与重试逻辑保持与原有内容富化一致

2026-04-09 12:35:13 +0800

78cdef1c 添加字段enriched_taxonomy_attributes Browse Code »

tangwang
2026-04-09 11:56:59 +0800

08 Apr, 2026

6 commits

f27a8d90 ES文档维护 Browse Code »

tangwang
2026-04-08 16:47:20 +0800
8e081187 ERR打分，停止概率的计算公式修改为通用方法： p(t) = (2^t - 1) / 2^{max_grade} Browse Code »

tangwang
2026-04-08 15:50:55 +0800
b8ffa7e9 update clip-as-service submodule pointer Browse Code »

tangwang
2026-04-08 15:40:50 +0800
d73ca84a refine eval case snapshots and rename relevance levels Browse Code »

tangwang
2026-04-08 15:39:47 +0800

1fdab52d This change adjusts the BM25 parameters used by the combined query. ... Browse Code »

Previously, both `b` and `k1` were set to `0.0`. The original intention
was to avoid two common issues in e-commerce search relevance:

1. Over-penalizing longer product titles
   In product search, a shorter title should not automatically rank
higher just because BM25 favors shorter fields. For example, for a query
like “遥控车”, a product whose title is simply “遥控车” is not
necessarily a better candidate than a product with a slightly longer but
more descriptive title. In practice, extremely short titles may even
indicate lower-quality catalog data.

2. Over-rewarding repeated occurrences of the same term
   For longer queries such as “遥控喷雾翻滚多功能车玩具车”, the default
BM25 behavior may give too much weight to a term that appears multiple
times (for example “遥控”), even when other important query terms such
as “喷雾” or “翻滚” are missing. This can cause products with repeated
partial matches to outrank products that actually cover more of the user
intent.

Setting both parameters to zero was an intentional way to suppress
length normalization and term-frequency amplification. However, after
introducing a `combined_fields` query, this configuration becomes too
aggressive. Since `combined_fields` scores multiple fields as a unified
relevance signal, completely disabling both effects may also remove
useful ranking information, especially when we still want documents
matching more query terms across fields to be distinguishable from
weaker matches.

This update therefore relaxes the previous setting and reintroduces a
controlled amount of BM25 normalization/scoring behavior. The goal is to
keep the original intent — avoiding short-title bias and excessive
repeated-term gain — while allowing the combined query to better
preserve meaningful relevance differences across candidates.

Expected effect:
- reduce the bias toward unnaturally short product titles
- limit score inflation caused by repeated occurrences of the same term
- improve ranking stability for `combined_fields` queries
- better reward candidates that cover more of the overall query intent,
  instead of those that only repeat a subset of terms

2026-04-08 14:39:54 +0800

dc22700d data/mai_jia_jing_ling/import_products/ Browse Code »

tangwang
2026-04-08 08:54:37 +0800

07 Apr, 2026

4 commits

483a05d9 文本搜索权重调整（qanchors enriched_tags enriched_attributes.value 调低） Browse Code »

tangwang
2026-04-07 23:14:50 +0800
6e3e6770 suggest文档维护 Browse Code »

tangwang
2026-04-07 22:14:59 +0800

9f33fe3c fix suggestion rebuild flow and es index creation ... Browse Code »

- consolidate suggestion rebuild flow into build_suggestions.sh via --rebuild and remove the redundant rebuild_suggestions.sh wrapper
- make suggestion versioned index names use microseconds and handle index-create retries/timeouts without false already_exists failures
- treat create requests as successful when the index was created server-side, then explicitly wait for shard readiness and surface allocation diagnostics
- clean up freshly created suggestion indices on rebuild failure to avoid leaving red orphan indices behind
- make rebuild smoke tests target the local backend by default, with SUGGESTIONS_SMOKE_BASE_URL as the explicit override
- add unit coverage for microsecond versioned index names and cleanup on unallocatable index failures

2026-04-07 12:43:04 +0800

e50924ed 1. tags -> enriched_tags ... Browse Code »
```
2. issues文档
```
tangwang
2026-04-07 11:45:15 +0800

04 Apr, 2026

2 commits

441f049d 评测体系优化，以及 ... Browse Code »
```
Exact Match
High Relevant
Low Relevant
Irrelevant

to

Fully Relevant
Mostly Relevant
Weakly Relevant
Irrelevant
```
tangwang
2026-04-04 22:14:42 +0800
f5da42e6 标注提示词优化 Browse Code »

tangwang
2026-04-04 19:02:43 +0800

03 Apr, 2026

2 commits

ccbdf870 enriched_attributes.value字段参与搜索 Browse Code »

tangwang
2026-04-03 21:11:50 +0800
639bee0a issues整理（评估框架&LTR日志准备&FM拟合效果初探） Browse Code »

tangwang
2026-04-03 08:17:41 +0800

02 Apr, 2026

4 commits

41345271 文档更新 Browse Code »

tangwang
2026-04-02 19:46:27 +0800

465f90e1 添加LTR数据收集 ... Browse Code »

目前在54训练数据里面，拆分44条train + 10
test，训练集显著提升但是test上不及基线
作为基础设施保留，以后可以考虑扩大数据集进行使用

2026-04-02 16:20:45 +0800

a6d51aa7 eval Browse Code »

tangwang
2026-04-02 13:56:45 +0800
4d000c94 融合公式调参 Browse Code »

tangwang
2026-04-02 12:14:41 +0800

01 Apr, 2026

18 commits

30b490e1 添加ERR评估指标 Browse Code »

tangwang
2026-04-01 22:18:40 +0800
7ddd4cb3 评估体系从三等级->四等级 Exact Match / High Relevant / Low Relevant / ... Browse Code »
```
Irrelevant
```
tangwang
2026-04-01 21:35:57 +0800
9df421ed 基于eval框架开始调参 Browse Code »

tangwang
2026-04-01 20:05:22 +0800

80f1e036 enriched_attributes 现在会按 name 聚合，同名项下的 value.zh / value.en ... Browse Code »

都会合并成数组，和 qanchors / enriched_tags
的处理方式保持一致，更符合你现在这套 ES mapping 的灌入方式。ES 的 text
字段本身支持数组，所以像 value.zh: ["舒适", "无鞋带设计"]
这种写法是可以正常入库的；nested
只是外层对象数组，不影响内部语言字段存数组。

2026-04-01 17:27:51 +0800

42024409 评估框架-批量打标 Browse Code »

tangwang
2026-04-01 16:57:58 +0800
286e9b4f evalution Browse Code »

tangwang
2026-04-01 16:10:30 +0800
310bb3bc eval tools Browse Code »

tangwang
2026-04-01 15:59:19 +0800
331861d5 eval框架配置化 Browse Code »

tangwang
2026-04-01 15:29:36 +0800
1c2ba48e eval tagger Browse Code »

tangwang
2026-04-01 14:43:00 +0800

cdd8ee3a eval框架日志独立 ... Browse Code »

现在的行为（按你的路径）
用途	路径（相对仓库根 PROJECT_ROOT）
评估主日志（CLI + framework 的 INFO）	logs/eval.log
LLM 全量 prompt / 原始响应	logs/verbose/eval_verbose.log
实现要点：

constants.py：EVAL_LOG_DIR、EVAL_VERBOSE_LOG_DIR、EVAL_LOG_FILE、EVAL_VERBOSE_LOG_FILE。
logging_setup.py：setup_eval_logging() 给名为 search_eval 的 logger 挂
文件 + stderr，只初始化一次；build_annotation_set.py / serve_eval_web.py
走的 eval_framework.cli.main() 开头会先调用。
cli.py：原来的 print 改为 search_eval.cli 的 logging.info；启动时写一条
CLI start command=... log_file=... 到 logs/eval.log。
framework.py：rebuild 相关 print 改为 search_eval.framework 的
logging.info。
clients.py：verbose 改为写入
logs/verbose/eval_verbose.log；首次需要时调用 setup_eval_logging()，并用
search_eval.info 提示 verbose 文件路径（不再用 print）。

2026-04-01 14:19:27 +0800

35ae3b29 批量评估框架，召回参数修改和llm评估终止条件优化 Browse Code »

tangwang
2026-04-01 11:47:33 +0800

dedd31c5 1. 搜索 recall 池「1 分」条数（DEFAULT_SEARCH_RECALL_TOP_K） ... Browse Code »

scripts/evaluation/eval_framework/constants.py：500 → 200
Rebuild 里 rank <= recall_n 的 rerank_score: 1.0 仍按该 K 生效。
2. LLM 批次上下限
最少批次：DEFAULT_REBUILD_MIN_LLM_BATCHES 20 → 10
最多批次：仍为 40（未改）
3. 提前结束条件（_annotate_rebuild_batches）
在已跑满 min_batches 之后，对每个批次：

本批无 Exact（exact_n == 0），且满足其一即视为 bad batch：
irrelevant_ratio >= 0.94
或 (irrelevant + Low Relevant) / n >= 0.96（弱相关用 RELEVANCE_LOW）
连续 2 个 bad batch 则 early stop（原先是连续 3 次、irrelevant >
0.92）。

批次日志里增加了 low_ratio、irrelevant_plus_low_ratio；rebuild
元数据里增加了 rebuild_irrel_low_combined_stop_ratio。

4. CLI
--search-recall-top-k 说明改为默认 200
--rebuild-min-batches 说明改为默认 10
--rebuild-irrelevant-stop-ratio / --rebuild-irrelevant-stop-streak
说明与新逻辑一致
新增 --rebuild-irrel-low-combined-stop-ratio（默认 0.96）

2026-04-01 11:22:59 +0800

90de78aa enrich接口因为接口迭代、跟缓存不兼容，bug修复 ... Browse Code »

`indexer/product_enrich.py`，不是再补一层判断。

根因有两个：缓存 key 按内容复用，但缓存值里还带着旧商品的
`id/title_input`；同时内部分析结果在历史上混用了 `tags` 和
`enriched_tags`。这样一旦命中旧缓存，`build_index_content_fields()`
会因为 `id` 对不上把结果丢掉，最后对外就变成全空。

现在的处理是：
- 内部分析结果统一用 `tags` 作为 LLM/缓存层字段。
- 对外只在 `build_index_content_fields()` 封装时映射成
  `enriched_tags`，`enriched_attributes` 里也统一产出
`name="enriched_tags"`。
- 读取缓存时会先做归一化：把旧缓存里的 `enriched_tags` 兼容成内部
  `tags`，并把命中的缓存结果重绑到当前请求商品的 `id/title_input`。
- 写缓存时也统一写成归一化后的内部结构，并且空内容不再写入缓存。

2026-04-01 11:06:23 +0800

a3734f13 eval任务美国地区不支持batch调用，改为在线调用 Browse Code »

tangwang
2026-04-01 10:40:32 +0800
a345b01f eval framework Browse Code »

tangwang
2026-04-01 10:00:45 +0800
46d94a05 评估标准修改 Browse Code »

tangwang
2026-04-01 09:40:39 +0800
bdb65283 标注框架批量标注 Browse Code »

tangwang
2026-04-01 09:34:12 +0800
167f33b4 eval框架前端 Browse Code »

tangwang
2026-04-01 07:40:04 +0800

31 Mar, 2026

2 commits

d172c259 eval框架 Browse Code »

tangwang
2026-03-31 23:27:53 +0800
3ac1f8d1 评估标准优化 Browse Code »

tangwang
2026-03-31 22:48:50 +0800