Commit Graph

1489 Commits

Author SHA1 Message Date
liunux4odoo a03b8d330d
升级到langchain==0.0.287,fschat==0.0.28;处理langchain.Milvus bug(#10492) (#1454)
* 修复milvus_kb_service中一些bug,添加文档后将数据同步到数据库
* 升级到langchain==0.0.287,fschat==0.0.28;处理langchain.Milvus bug(#10492)
* 修复切换模型BUG: 从在线API切换模型时出错
2023-09-13 08:43:11 +08:00
liunux4odoo efd6d4a251
修复milvus_kb_service中一些bug,添加文档后将数据同步到数据库 (#1452) 2023-09-12 22:34:03 +08:00
imClumsyPanda 4aa14b859e
增加 ChineseRecursiveTextSplitter (#1447)
* add RapidOCRPDFLoader

* update mypdfloader.py and requirements.txt

* add myimgloader.py

* add test samples

* add TODO to mypdfloader

* add loaders to KnowledgeFile class

* add loaders to KnowledgeFile class

* add ChineseRecursiveTextSplitter

* add ChineseRecursiveTextSplitter
2023-09-12 17:38:52 +08:00
imClumsyPanda 1d0c6a9184 Merge branch 'master' into pre-release 2023-09-12 15:51:53 +08:00
imClumsyPanda 2020f5bb88 merge master 2023-09-12 15:51:28 +08:00
liunux4odoo 03c021d50f
添加MiniMax在线API支持 (#1445) 2023-09-12 15:24:47 +08:00
imClumsyPanda 8064586374 delete .doc from supported formats 2023-09-12 15:22:18 +08:00
liunux4odoo 8a12b9c658
添加自动关闭issue的action (#1437) 2023-09-12 10:16:19 +08:00
liunux4odoo 60f88ae92e
fix: 由于server.utils导入model_workers时,导入fastchat model_worker,导致startup启动延迟明显 (#1436) 2023-09-12 08:52:00 +08:00
liunux4odoo 6a03611750
fix api and webui: (#1435)
1. fix #1431: 优化知识库问答与搜索引擎问答的API接口,避免docs重复返回
2. startup.py根据configs.log_verbose控制log级别
3. 修复/llm_model/list_models的bug:
   只有一个参数时,fastapi未返回json导致视图函数出错
2023-09-12 08:31:17 +08:00
liunux4odoo 22ff073309
修改Embeddings和FAISS缓存加载方式,知识库相关API接口支持多线程并发 (#1434)
* 修改Embeddings和FAISS缓存加载方式,支持多线程,支持内存FAISS

* 知识库相关API接口支持多线程并发

* 根据新的API接口调整ApiRequest和测试用例

* 删除webui.py失效的启动说明
2023-09-11 20:41:41 +08:00
imClumsyPanda 518d3c4951
Update README.md 2023-09-11 15:18:03 +08:00
imClumsyPanda be621772ab
Add files via upload 2023-09-11 15:17:38 +08:00
Zhi-guo Huang d0e654d847
Merge pull request #1422 from hzg0601/dev
update readme.md,readme_cn.md:更新订阅号QRcode
2023-09-09 17:36:10 +08:00
hzg0601 ed6df7cd24 update readme.md,readme_cn.md:更新订阅号QRcode 2023-09-09 17:34:56 +08:00
glide-the b0a9d8f30e 增加显示ocr识别进度 2023-09-08 21:32:41 +08:00
glide-the 60e6887e94
Merge pull request #1419 from glide-the/dev
增加显示详细日志开关
2023-09-08 20:50:31 +08:00
glide-the 5550e3c1a9 增加显示详细日志开关 2023-09-08 20:48:31 +08:00
liunux4odoo 228fc5bd93 fix startup.py: correct processes count 2023-09-08 15:24:43 +08:00
liunux4odoo f082edbaad fix startup.py: correct processes count 2023-09-08 15:23:54 +08:00
liunux4odoo f94f2793f8
update startup.py: (#1416)
1. 恢复模型切换功能
2. --model-name支持多个名称(空格分开),同时启动多个模型
3. 优化服务启动顺序。严格按照顺序启动:controller -> [openai-api,
   model_worker, api_worker]并行 -> api.py -> webui.py
4. 修复:从在线API模型切换到本地模型时失败
2023-09-08 15:18:13 +08:00
Kyle Zhang 6e9a7e0c89
chore: update .gitignore (#1364)
Added '/' before knowledge_base/ and configs/*.py to make them match
directories at root folder only.

Signed-off-by: Kyle Zhang <zhangk1985@gmail.com>
Co-authored-by: imClumsyPanda <littlepanda0716@gmail.com>
2023-09-08 13:29:40 +08:00
liunux4odoo 775870a516
改变api视图函数的sync/async,提高api并发能力: (#1414)
1. 4个chat类接口改为async
2. 知识库操作,涉及向量库修改的使用async,避免FAISS写入错误;涉及向量库读取的改为sync,提高并发
2023-09-08 12:25:02 +08:00
roki1031 196bde3ead
提醒fitz包是来自pyMuPDF (#1407) 2023-09-08 12:20:20 +08:00
liunux4odoo 1195eb75eb merge pr1413 2023-09-08 10:30:07 +08:00
Zhi-guo Huang f2aed727c2
Merge pull request #1412 from hzg0601/dev
update reamd_en.md:删除中文对照
2023-09-08 10:22:24 +08:00
liunux4odoo 4cfee9c17c 根据新的接口修改ApiRequest和webui,以及测试用例。修改后预期webui中批量知识文件相关操作减少时间 2023-09-08 10:22:04 +08:00
hzg0601 136b667683 update reamd_en.md:删除中文对照 2023-09-08 10:19:49 +08:00
Zhi-guo Huang 71a1ae0627
Merge pull request #1411 from hzg0601/dev
增加中英对照的reame_en.md, 更新readme.md
2023-09-08 10:01:57 +08:00
hzg0601 607831496b update readme.md 2023-09-08 09:54:11 +08:00
hzg0601 bf748eb732 add README_en.md: 增加中英对照的readme 2023-09-08 09:46:23 +08:00
hzg0601 10237d24ca update readme_cn.md 2023-09-08 09:32:33 +08:00
liunux4odoo 661a0e9d72 新功能:
- 知识库管理中的add_docs/delete_docs/update_docs均支持批量操作,并利用多线程提高效率
- API的重建知识库接口支持多线程
- add_docs可提供参数控制上传文件后是否继续进行向量化
- add_docs/update_docs支持传入自定义docs(以json形式)。后续考虑区分完整或补充式自定义docs
- download_doc接口添加`preview`参数,支持下载或预览
- kb_service增加`save_vector_store`方法,便于保存向量库(仅FAISS,其它无操作)
- 将document_loader & text_splitter逻辑从KnowledgeFile中抽离出来,为后续对内存文件进行向量化做准备
- KowledgeFile支持docs & splitted_docs的缓存,方便在中间过程做一些自定义

其它:
- 将部分错误输出由print改为logger.error
2023-09-08 08:55:12 +08:00
Light f422575a17
修复chat接口报错时信息显示KeyError: 'answer'的Bug (#1399)
Co-authored-by: Light <>
2023-09-07 20:44:21 +08:00
hzg0601 80f00e27f9 update readme.md, add reame_en.md: 1. update supported models list;2. add readme_en.md 2023-09-07 16:27:12 +08:00
imClumsyPanda 6ea4fc7a30 Merge branch 'master' into dev 2023-09-06 21:19:59 +08:00
imClumsyPanda 9ed43ab2ab Merge branch 'master' into pre-release 2023-09-06 21:19:38 +08:00
imClumsyPanda 54cf79f521 add support for new embedding models sensenova/piccolo-base-zh and sensenova/piccolo-large-zh 2023-09-06 21:17:32 +08:00
imClumsyPanda eeb152e22c Merge branch 'master' into dev 2023-09-06 20:09:53 +08:00
imClumsyPanda 59bdfd85ea Merge branch 'master' into pre-release 2023-09-06 20:09:36 +08:00
imClumsyPanda f607424c11 fix pdf ocr bug 2023-09-06 20:09:18 +08:00
imClumsyPanda 673410ecd5 fix pdf ocr bug 2023-09-06 20:08:56 +08:00
imClumsyPanda 8ec3db94f7
Update README.md 2023-09-06 20:04:03 +08:00
imClumsyPanda e9b25e3f14
Add files via upload 2023-09-06 20:03:41 +08:00
hzg0601 76c2c61bb2 update server.model_worker.zhipu.py:更新prompt,遵守zhipu的格式要求 2023-09-06 17:33:00 +08:00
hzg0601 5e4bd5c3d3 update server.model_worker.zhipu.py/base.py:增加流式响应 2023-09-06 11:16:16 +08:00
jinke bc0a39ab82 update docker desc
更新0.2.3 docker
2023-09-06 10:23:34 +08:00
hzg0601 6f039cfdeb update server.model_workers.base.py/zhipu.py: 支持zhipuapi的stream 流式输出 2023-09-05 23:13:42 +08:00
imClumsyPanda 0ea18314f2 fix unit test name 2023-09-05 21:08:02 +08:00
glide-the 082f429809
Merge pull request #1382 from glide-the/dev
增加构建文档metadata检索的开关,移动到dev_metadata_search
2023-09-05 21:02:18 +08:00