Langchain-Chatchat/requirements_lite.txt

# API requirements

# On Windows system, install the cuda version manually from https://pytorch.org/
# torch~=2.1.2
# torchvision~=0.16.2
# torchaudio~=2.1.2
# xformers==0.0.23.post1
# transformers==4.36.2
# sentence_transformers==2.2.2

langchain==0.0.352
langchain-experimental==0.0.47
pydantic==1.10.13
fschat==0.2.34
openai~=1.6.0
fastapi>=0.105
sse_starlette
nltk>=3.8.1
uvicorn>=0.24.0.post1
starlette~=0.27.0
unstructured[docx,csv]==0.11.0 # add pdf if need
python-magic-bin; sys_platform == 'win32'
SQLAlchemy==2.0.19
faiss-cpu~=1.7.4 # `conda install faiss-gpu -c conda-forge` if you want to accelerate with gpus
# accelerate==0.24.1
# spacy~=3.7.2
# PyMuPDF~=1.23.8
# rapidocr_onnxruntime~=1.3.8
requests>=2.31.0
pathlib>=1.0.1
pytest>=7.4.3
numexpr>=2.8.6 # max version for py38
strsimpy>=0.2.1
markdownify>=0.11.6
# tiktoken~=0.5.2
tqdm>=4.66.1
websockets>=12.0
numpy~=1.26.2
pandas~=2.1.4
# einops>=0.7.0
# transformers_stream_generator==0.0.4
# vllm==0.2.6; sys_platform == "linux"
httpx[brotli,http2,socks]~=0.25.2

# optional document loaders

rapidocr_paddle[gpu]>=1.3.0.post5 # gpu accelleration for ocr of pdf and image files
jq>=1.6.0 # for .json and .jsonl files. suggest `conda install jq` on windows
# html2text # for .enex files
beautifulsoup4~=4.12.2 # for .mhtml files
pysrt~=1.1.2

# Online api libs dependencies

zhipuai>=1.0.7，<=2.0.0 # zhipu
dashscope>=1.13.6 # qwen
# volcengine>=1.0.119 # fangzhou

# uncomment libs if you want to use corresponding vector store
# pymilvus>=2.3.4
# psycopg2==2.9.9
# pgvector>=0.2.4

# Agent and Search Tools

arxiv>=2.0.0
youtube-search>=2.1.2
duckduckgo-search>=3.9.9
metaphor-python>=0.1.23

# WebUI requirements

streamlit~=1.29.0 # do remember to add streamlit to environment variables if you use windows
streamlit-option-menu>=0.3.6
streamlit-chatbox==1.1.11
streamlit-modal>=0.1.0
streamlit-aggrid>=0.3.4.post3
httpx[brotli,http2,socks]>=0.25.2
watchdog>=3.0.0
-												update requirements: 统一各文件依赖顺序，便于比对；移出streamlit-antd-components; jq默认安装

											
										
										
											2023-12-19 15:59:41 +08:00
+								# API requirements
-												update requirements: 统一各文件依赖顺序，便于比对；移出streamlit-antd-components; jq默认安装; numexpr 限定 2.8.6 以兼容 py38

											
										
										
											2023-12-29 09:31:16 +08:00
+								# On Windows system, install the cuda version manually from https://pytorch.org/
 								# torch~=2.1.2
 								# torchvision~=0.16.2
 								# torchaudio~=2.1.2
 								# xformers==0.0.23.post1
 								# transformers==4.36.2
 								# sentence_transformers==2.2.2
 								langchain==0.0.352
 								langchain-experimental==0.0.47
-												依赖升级 (#2246)

大量依赖升级到最新版
测试环境：Ubuntu 22.04 LTS kernel 6.6.3 Python 3.10.12，Cuda12.3 update1
											
										
										
											2023-12-01 11:34:41 +08:00
+								pydantic==1.10.13
-												update requirements: 统一各文件依赖顺序，便于比对；移出streamlit-antd-components; jq默认安装; numexpr 限定 2.8.6 以兼容 py38

											
										
										
											2023-12-29 09:31:16 +08:00
+								fschat==0.2.34
 								openai~=1.6.0
-												修复 Langchain 更新之后OpenAI在线embed无法正常使用问题

											
										
										
											2023-12-23 11:36:11 +08:00
+								fastapi>=0.105
-												update requirements: 统一各文件依赖顺序，便于比对；移出streamlit-antd-components; jq默认安装

											
										
										
											2023-12-19 15:59:41 +08:00
+								sse_starlette
 								nltk>=3.8.1
-												依赖升级 (#2246)

大量依赖升级到最新版
测试环境：Ubuntu 22.04 LTS kernel 6.6.3 Python 3.10.12，Cuda12.3 update1
											
										
										
											2023-12-01 11:34:41 +08:00
+								uvicorn>=0.24.0.post1
-												支持lite模式：无需安装torch等重依赖，通过在线API实现LLM对话和搜索引擎对话 (#1860)

* move get_default_llm_model from webui to ApiRequest

增加API接口及其测试用例：
- /server/get_prompt_template: 获取服务器配置的 prompt 模板
- 增加知识库多线程访问测试用例

支持lite模式：无需安装torch等重依赖，通过在线API实现LLM对话和搜索引擎对话

* fix bug in server.api

---------

Co-authored-by: imClumsyPanda <littlepanda0716@gmail.com>
											
										
										
											2023-10-25 08:30:23 +08:00
+								starlette~=0.27.0
-												update reqs and server/utils

											
										
										
											2023-11-30 22:23:15 +08:00
+								unstructured[docx,csv]==0.11.0 # add pdf if need
-												支持在线 Embeddings, Lite 模式支持所有知识库相关功能 (#1924)

新功能：
- 支持在线 Embeddings：zhipu-api, qwen-api, minimax-api, qianfan-api
- API 增加 /other/embed_texts 接口
- init_database.py 增加 --embed-model 参数，可以指定使用的嵌入模型（本地或在线均可）
- 对于 FAISS 知识库，支持多向量库，默认位置：{KB_PATH}/vector_store/{embed_model}
- Lite 模式支持所有知识库相关功能。此模式下最主要的限制是：
  - 不能使用本地 LLM 和 Embeddings 模型
  - 知识库不支持 PDF 文件
- init_database.py 重建知识库时不再默认情况数据库表，增加 clear-tables 参数手动控制。
- API 和 WEBUI 中 score_threshold 参数范围改为 [0, 2]，以更好的适应在线嵌入模型

问题修复：
- API 中 list_config_models 会删除 ONLINE_LLM_MODEL 中的敏感信息，导致第二轮API请求错误

开发者：
- 统一向量库的识别：以（kb_name,embed_model）为判断向量库唯一性的依据，避免 FAISS 知识库缓存加载逻辑错误
- KBServiceFactory.get_service_by_name 中添加 default_embed_model 参数，用于在构建新知识库时设置 embed_model
- 优化 kb_service 中 Embeddings 操作：
  - 统一加载接口： server.utils.load_embeddings，利用全局缓存避免各处 Embeddings 传参
  - 统一文本嵌入接口：server.knowledge_base.kb_service.base.[embed_texts, embed_documents]
- 重写 normalize 函数，去除对 scikit-learn/scipy 的依赖 

											
										
										
											2023-10-31 14:26:50 +08:00
+								python-magic-bin; sys_platform == 'win32'
-												支持lite模式：无需安装torch等重依赖，通过在线API实现LLM对话和搜索引擎对话 (#1860)

* move get_default_llm_model from webui to ApiRequest

增加API接口及其测试用例：
- /server/get_prompt_template: 获取服务器配置的 prompt 模板
- 增加知识库多线程访问测试用例

支持lite模式：无需安装torch等重依赖，通过在线API实现LLM对话和搜索引擎对话

* fix bug in server.api

---------

Co-authored-by: imClumsyPanda <littlepanda0716@gmail.com>
											
										
										
											2023-10-25 08:30:23 +08:00
+								SQLAlchemy==2.0.19
-												update requirements: 统一各文件依赖顺序，便于比对；移出streamlit-antd-components; jq默认安装; numexpr 限定 2.8.6 以兼容 py38

											
										
										
											2023-12-29 09:31:16 +08:00
+								faiss-cpu~=1.7.4 # `conda install faiss-gpu -c conda-forge` if you want to accelerate with gpus
 								# accelerate==0.24.1
 								# spacy~=3.7.2
 								# PyMuPDF~=1.23.8
 								# rapidocr_onnxruntime~=1.3.8
-												update requirements: 统一各文件依赖顺序，便于比对；移出streamlit-antd-components; jq默认安装

											
										
										
											2023-12-19 15:59:41 +08:00
+								requests>=2.31.0
 								pathlib>=1.0.1
 								pytest>=7.4.3
-												update requirements: 统一各文件依赖顺序，便于比对；移出streamlit-antd-components; jq默认安装; numexpr 限定 2.8.6 以兼容 py38

											
										
										
											2023-12-29 09:31:16 +08:00
+								numexpr>=2.8.6 # max version for py38
-												update requirements: 统一各文件依赖顺序，便于比对；移出streamlit-antd-components; jq默认安装

											
										
										
											2023-12-19 15:59:41 +08:00
+								strsimpy>=0.2.1
 								markdownify>=0.11.6
-												update requirements: 统一各文件依赖顺序，便于比对；移出streamlit-antd-components; jq默认安装; numexpr 限定 2.8.6 以兼容 py38

											
										
										
											2023-12-29 09:31:16 +08:00
+								# tiktoken~=0.5.2
-												update requirements: 统一各文件依赖顺序，便于比对；移出streamlit-antd-components; jq默认安装

											
										
										
											2023-12-19 15:59:41 +08:00
+								tqdm>=4.66.1
-												update requirements: 统一各文件依赖顺序，便于比对；移出streamlit-antd-components; jq默认安装; numexpr 限定 2.8.6 以兼容 py38

											
										
										
											2023-12-29 09:31:16 +08:00
+								websockets>=12.0
 								numpy~=1.26.2
 								pandas~=2.1.4
 								# einops>=0.7.0
 								# transformers_stream_generator==0.0.4
 								# vllm==0.2.6; sys_platform == "linux"
 								httpx[brotli,http2,socks]~=0.25.2
-												支持lite模式：无需安装torch等重依赖，通过在线API实现LLM对话和搜索引擎对话 (#1860)

* move get_default_llm_model from webui to ApiRequest

增加API接口及其测试用例：
- /server/get_prompt_template: 获取服务器配置的 prompt 模板
- 增加知识库多线程访问测试用例

支持lite模式：无需安装torch等重依赖，通过在线API实现LLM对话和搜索引擎对话

* fix bug in server.api

---------

Co-authored-by: imClumsyPanda <littlepanda0716@gmail.com>
											
										
										
											2023-10-25 08:30:23 +08:00
-												ocr 支持 GPU 加速（需要手动安装 rapidocr_paddle[gpu])；知识库支持 MHTML 和 Evernote 文件。 (#2265)

在 requirements 和 Wiki 中增加对可选文档加载器 SDK 的说明 ( close #2264 )

											
										
										
											2023-12-04 09:39:56 +08:00
+								# optional document loaders
-												update requirements: 统一各文件依赖顺序，便于比对；移出streamlit-antd-components; jq默认安装; numexpr 限定 2.8.6 以兼容 py38

											
										
										
											2023-12-29 09:31:16 +08:00
 								rapidocr_paddle[gpu]>=1.3.0.post5 # gpu accelleration for ocr of pdf and image files
 								jq>=1.6.0 # for .json and .jsonl files. suggest `conda install jq` on windows
-												ocr 支持 GPU 加速（需要手动安装 rapidocr_paddle[gpu])；知识库支持 MHTML 和 Evernote 文件。 (#2265)

在 requirements 和 Wiki 中增加对可选文档加载器 SDK 的说明 ( close #2264 )

											
										
										
											2023-12-04 09:39:56 +08:00
+								# html2text # for .enex files
-												update requirements: 统一各文件依赖顺序，便于比对；移出streamlit-antd-components; jq默认安装; numexpr 限定 2.8.6 以兼容 py38

											
										
										
											2023-12-29 09:31:16 +08:00
+								beautifulsoup4~=4.12.2 # for .mhtml files
 								pysrt~=1.1.2
-												ocr 支持 GPU 加速（需要手动安装 rapidocr_paddle[gpu])；知识库支持 MHTML 和 Evernote 文件。 (#2265)

在 requirements 和 Wiki 中增加对可选文档加载器 SDK 的说明 ( close #2264 )

											
										
										
											2023-12-04 09:39:56 +08:00
-												update requirements: 统一各文件依赖顺序，便于比对；移出streamlit-antd-components; jq默认安装

											
										
										
											2023-12-19 15:59:41 +08:00
+								# Online api libs dependencies
-												依赖升级 (#2246)

大量依赖升级到最新版
测试环境：Ubuntu 22.04 LTS kernel 6.6.3 Python 3.10.12，Cuda12.3 update1
											
										
										
											2023-12-01 11:34:41 +08:00
-												修复 Langchain 更新之后OpenAI在线embed无法正常使用问题

											
										
										
											2023-12-23 11:36:11 +08:00
+								zhipuai>=1.0.7，<=2.0.0 # zhipu
-												update requirements: 统一各文件依赖顺序，便于比对；移出streamlit-antd-components; jq默认安装; numexpr 限定 2.8.6 以兼容 py38

											
										
										
											2023-12-29 09:31:16 +08:00
+								dashscope>=1.13.6 # qwen
 								# volcengine>=1.0.119 # fangzhou
-												支持lite模式：无需安装torch等重依赖，通过在线API实现LLM对话和搜索引擎对话 (#1860)

* move get_default_llm_model from webui to ApiRequest

增加API接口及其测试用例：
- /server/get_prompt_template: 获取服务器配置的 prompt 模板
- 增加知识库多线程访问测试用例

支持lite模式：无需安装torch等重依赖，通过在线API实现LLM对话和搜索引擎对话

* fix bug in server.api

---------

Co-authored-by: imClumsyPanda <littlepanda0716@gmail.com>
											
										
										
											2023-10-25 08:30:23 +08:00
 								# uncomment libs if you want to use corresponding vector store
-												update requirements: 统一各文件依赖顺序，便于比对；移出streamlit-antd-components; jq默认安装; numexpr 限定 2.8.6 以兼容 py38

											
										
										
											2023-12-29 09:31:16 +08:00
+								# pymilvus>=2.3.4
 								# psycopg2==2.9.9
-												依赖升级 (#2246)

大量依赖升级到最新版
测试环境：Ubuntu 22.04 LTS kernel 6.6.3 Python 3.10.12，Cuda12.3 update1
											
										
										
											2023-12-01 11:34:41 +08:00
+								# pgvector>=0.2.4
-												支持lite模式：无需安装torch等重依赖，通过在线API实现LLM对话和搜索引擎对话 (#1860)

* move get_default_llm_model from webui to ApiRequest

增加API接口及其测试用例：
- /server/get_prompt_template: 获取服务器配置的 prompt 模板
- 增加知识库多线程访问测试用例

支持lite模式：无需安装torch等重依赖，通过在线API实现LLM对话和搜索引擎对话

* fix bug in server.api

---------

Co-authored-by: imClumsyPanda <littlepanda0716@gmail.com>
											
										
										
											2023-10-25 08:30:23 +08:00
-												update requirements: 统一各文件依赖顺序，便于比对；移出streamlit-antd-components; jq默认安装

											
										
										
											2023-12-19 15:59:41 +08:00
+								# Agent and Search Tools
 								arxiv>=2.0.0
 								youtube-search>=2.1.2
 								duckduckgo-search>=3.9.9
 								metaphor-python>=0.1.23
 								# WebUI requirements
-												update requirements: 统一各文件依赖顺序，便于比对；移出streamlit-antd-components; jq默认安装; numexpr 限定 2.8.6 以兼容 py38

											
										
										
											2023-12-29 09:31:16 +08:00
+								streamlit~=1.29.0 # do remember to add streamlit to environment variables if you use windows
-												支持lite模式：无需安装torch等重依赖，通过在线API实现LLM对话和搜索引擎对话 (#1860)

* move get_default_llm_model from webui to ApiRequest

增加API接口及其测试用例：
- /server/get_prompt_template: 获取服务器配置的 prompt 模板
- 增加知识库多线程访问测试用例

支持lite模式：无需安装torch等重依赖，通过在线API实现LLM对话和搜索引擎对话

* fix bug in server.api

---------

Co-authored-by: imClumsyPanda <littlepanda0716@gmail.com>
											
										
										
											2023-10-25 08:30:23 +08:00
+								streamlit-option-menu>=0.3.6
-												update requirements: 统一各文件依赖顺序，便于比对；移出streamlit-antd-components; jq默认安装; numexpr 限定 2.8.6 以兼容 py38

											
										
										
											2023-12-29 09:31:16 +08:00
+								streamlit-chatbox==1.1.11
-												修复Azure 不设置Max token的bug (#2254)


											
										
										
											2023-12-02 16:50:56 +08:00
+								streamlit-modal>=0.1.0
-												支持lite模式：无需安装torch等重依赖，通过在线API实现LLM对话和搜索引擎对话 (#1860)

* move get_default_llm_model from webui to ApiRequest

增加API接口及其测试用例：
- /server/get_prompt_template: 获取服务器配置的 prompt 模板
- 增加知识库多线程访问测试用例

支持lite模式：无需安装torch等重依赖，通过在线API实现LLM对话和搜索引擎对话

* fix bug in server.api

---------

Co-authored-by: imClumsyPanda <littlepanda0716@gmail.com>
											
										
										
											2023-10-25 08:30:23 +08:00
+								streamlit-aggrid>=0.3.4.post3
-												update requirements: 统一各文件依赖顺序，便于比对；移出streamlit-antd-components; jq默认安装; numexpr 限定 2.8.6 以兼容 py38

											
										
										
											2023-12-29 09:31:16 +08:00
+								httpx[brotli,http2,socks]>=0.25.2
-												修复Azure 不设置Max token的bug (#2254)


											
										
										
											2023-12-02 16:50:56 +08:00
+								watchdog>=3.0.0