16 Apr, 2026

1 commit

  • This branch is intended to differ from master only by deployment configuration for the test machine.
    
    - Add `.env.test.example` as a secrets-free override snippet to be appended onto `.env`.
      - Pins TEI to GPU mode (`TEI_DEVICE=cuda`) with `float16` for performance.
      - Pins a Tesla T4 compatible TEI image (`text-embeddings-inference:turing-1.9`) to avoid
        compute-capability mismatch errors (T4=sm75 vs non-turing images compiled for sm80).
      - Keeps TEI request limits aligned with current service settings (`TEI_MAX_BATCH_TOKENS=2048`,
        `TEI_MAX_CLIENT_BATCH_SIZE=8`) and provides an example BGE-M3 snapshot path.
    - Extend `.env.example` with guidance on selecting the correct TEI image tag (`turing-*` for T4,
      `cuda-*` for Ampere+) and optional mirror repository override.
    
    No credentials are committed; `.env` remains local-only.
    
    Made-with: Cursor
    tangwang
     

14 Apr, 2026

3 commits


07 Apr, 2026

3 commits

  • tangwang
     
  • - consolidate suggestion rebuild flow into build_suggestions.sh via --rebuild and remove the redundant rebuild_suggestions.sh wrapper
    - make suggestion versioned index names use microseconds and handle index-create retries/timeouts without false already_exists failures
    - treat create requests as successful when the index was created server-side, then explicitly wait for shard readiness and surface allocation diagnostics
    - clean up freshly created suggestion indices on rebuild failure to avoid leaving red orphan indices behind
    - make rebuild smoke tests target the local backend by default, with SUGGESTIONS_SMOKE_BASE_URL as the explicit override
    - add unit coverage for microsecond versioned index names and cleanup on unallocatable index failures
    tangwang
     
  • 2. issues文档
    tangwang
     

04 Apr, 2026

2 commits


03 Apr, 2026

1 commit


02 Apr, 2026

4 commits


01 Apr, 2026

16 commits


31 Mar, 2026

10 commits