16 Apr, 2026
1 commit
-
This branch is intended to differ from master only by deployment configuration for the test machine. - Add `.env.test.example` as a secrets-free override snippet to be appended onto `.env`. - Pins TEI to GPU mode (`TEI_DEVICE=cuda`) with `float16` for performance. - Pins a Tesla T4 compatible TEI image (`text-embeddings-inference:turing-1.9`) to avoid compute-capability mismatch errors (T4=sm75 vs non-turing images compiled for sm80). - Keeps TEI request limits aligned with current service settings (`TEI_MAX_BATCH_TOKENS=2048`, `TEI_MAX_CLIENT_BATCH_SIZE=8`) and provides an example BGE-M3 snapshot path. - Extend `.env.example` with guidance on selecting the correct TEI image tag (`turing-*` for T4, `cuda-*` for Ampere+) and optional mirror repository override. No credentials are committed; `.env` remains local-only. Made-with: Cursor
31 Mar, 2026
1 commit
-
Made-with: Cursor
21 Mar, 2026
1 commit
17 Mar, 2026
1 commit
09 Mar, 2026
2 commits
-
CNCLIP_DEVICE=cuda TEI_USE_GPU=1 ./scripts/service_ctl.sh start 搜索后端+indexer+测试前段+4个微服务 跑通
06 Mar, 2026
1 commit
22 Dec, 2025
1 commit
09 Dec, 2025
1 commit
18 Nov, 2025
1 commit
12 Nov, 2025
1 commit
08 Nov, 2025
1 commit