16 Apr, 2026

1 commit

  • This branch is intended to differ from master only by deployment configuration for the test machine.
    
    - Add `.env.test.example` as a secrets-free override snippet to be appended onto `.env`.
      - Pins TEI to GPU mode (`TEI_DEVICE=cuda`) with `float16` for performance.
      - Pins a Tesla T4 compatible TEI image (`text-embeddings-inference:turing-1.9`) to avoid
        compute-capability mismatch errors (T4=sm75 vs non-turing images compiled for sm80).
      - Keeps TEI request limits aligned with current service settings (`TEI_MAX_BATCH_TOKENS=2048`,
        `TEI_MAX_CLIENT_BATCH_SIZE=8`) and provides an example BGE-M3 snapshot path.
    - Extend `.env.example` with guidance on selecting the correct TEI image tag (`turing-*` for T4,
      `cuda-*` for Ampere+) and optional mirror repository override.
    
    No credentials are committed; `.env` remains local-only.
    
    Made-with: Cursor
    tangwang
     

14 Apr, 2026

4 commits


08 Apr, 2026

1 commit


07 Apr, 2026

4 commits


04 Apr, 2026

2 commits


03 Apr, 2026

2 commits


02 Apr, 2026

4 commits


01 Apr, 2026

18 commits


31 Mar, 2026

4 commits