Langchain-Chatchat/server/chat/chat.py

from fastapi import Body
from fastapi.responses import StreamingResponse
from configs import LLM_MODEL, TEMPERATURE
from server.utils import wrap_done, get_ChatOpenAI
from langchain import LLMChain
from langchain.callbacks import AsyncIteratorCallbackHandler
from typing import AsyncIterable
import asyncio
from langchain.prompts.chat import ChatPromptTemplate
from typing import List
from server.chat.utils import History
from server.utils import get_prompt_template


async def chat(query: str = Body(..., description="用户输入", examples=["恼羞成怒"]),
                history: List[History] = Body([],
                                       description="历史对话",
                                       examples=[[
                                           {"role": "user", "content": "我们来玩成语接龙，我先来，生龙活虎"},
                                           {"role": "assistant", "content": "虎头虎脑"}]]
                                       ),
                stream: bool = Body(False, description="流式输出"),
                model_name: str = Body(LLM_MODEL, description="LLM 模型名称。"),
                temperature: float = Body(TEMPERATURE, description="LLM 采样温度", ge=0.0, le=1.0),
                # top_p: float = Body(TOP_P, description="LLM 核采样。勿与temperature同时设置", gt=0.0, lt=1.0),
                prompt_name: str = Body("llm_chat", description="使用的prompt模板名称(在configs/prompt_config.py中配置)"),
         ):
    history = [History.from_data(h) for h in history]

    async def chat_iterator(query: str,
                            history: List[History] = [],
                            model_name: str = LLM_MODEL,
                            prompt_name: str = prompt_name,
                            ) -> AsyncIterable[str]:
        callback = AsyncIteratorCallbackHandler()
        model = get_ChatOpenAI(
            model_name=model_name,
            temperature=temperature,
            callbacks=[callback],
        )

        prompt_template = get_prompt_template(prompt_name)
        input_msg = History(role="user", content=prompt_template).to_msg_template(False)
        chat_prompt = ChatPromptTemplate.from_messages(
            [i.to_msg_template() for i in history] + [input_msg])
        chain = LLMChain(prompt=chat_prompt, llm=model)

        # Begin a task that runs in the background.
        task = asyncio.create_task(wrap_done(
            chain.acall({"input": query}),
            callback.done),
        )

        if stream:
            async for token in callback.aiter():
                # Use server-sent-events to stream the response
                yield token
        else:
            answer = ""
            async for token in callback.aiter():
                answer += token
            yield answer

        await task

    return StreamingResponse(chat_iterator(query=query,
                                           history=history,
                                           model_name=model_name,
                                           prompt_name=prompt_name),
                             media_type="text/event-stream")
v0.2.0 first commit 2023-07-27 23:22:07 +08:00			`from fastapi import Body`
			`from fastapi.responses import StreamingResponse`
添加configs/prompt_config.py，允许用户自定义prompt模板： (#1504) 1、默认包含2个模板，分别用于LLM对话，知识库和搜索引擎对话 2、 server/utils.py提供函数get_prompt_template，获取指定的prompt模板内容（支持热加载） 3、 api.py中chat/knowledge_base_chat/search_engine_chat接口支持prompt_name参数 2023-09-17 13:27:11 +08:00			`from configs import LLM_MODEL, TEMPERATURE`
move wrap_done & get_ChatOpenAI from server.chat.utils to server.utils (#1506) 2023-09-17 16:19:50 +08:00			`from server.utils import wrap_done, get_ChatOpenAI`
v0.2.0 first commit 2023-07-27 23:22:07 +08:00			`from langchain import LLMChain`
			`from langchain.callbacks import AsyncIteratorCallbackHandler`
			`from typing import AsyncIterable`
			`import asyncio`
add history to chat apis 2023-08-08 23:54:51 +08:00			`from langchain.prompts.chat import ChatPromptTemplate`
update import pkgs and format 2023-08-10 21:26:05 +08:00			`from typing import List`
add history to chat apis 2023-08-08 23:54:51 +08:00			`from server.chat.utils import History`
添加configs/prompt_config.py，允许用户自定义prompt模板： (#1504) 1、默认包含2个模板，分别用于LLM对话，知识库和搜索引擎对话 2、 server/utils.py提供函数get_prompt_template，获取指定的prompt模板内容（支持热加载） 3、 api.py中chat/knowledge_base_chat/search_engine_chat接口支持prompt_name参数 2023-09-17 13:27:11 +08:00			`from server.utils import get_prompt_template`
v0.2.0 first commit 2023-07-27 23:22:07 +08:00

改变api视图函数的sync/async，提高api并发能力： (#1414) 1. 4个chat类接口改为async 2. 知识库操作，涉及向量库修改的使用async，避免FAISS写入错误；涉及向量库读取的改为sync，提高并发 2023-09-08 12:25:02 +08:00			`async def chat(query: str = Body(..., description="用户输入", examples=["恼羞成怒"]),`
			`history: List[History] = Body([],`
update import pkgs and format 2023-08-10 21:26:05 +08:00			`description="历史对话",`
			`examples=[[`
			`{"role": "user", "content": "我们来玩成语接龙，我先来，生龙活虎"},`
			`{"role": "assistant", "content": "虎头虎脑"}]]`
			`),`
对话接口支持temperature参数 (#1455) 2023-09-13 10:00:54 +08:00			`stream: bool = Body(False, description="流式输出"),`
			`model_name: str = Body(LLM_MODEL, description="LLM 模型名称。"),`
Dev (#1613) * 增加了仅限GPT4的agent功能，陆续补充，中文版readme已写 * issue提到的一个bug * 温度最小改成0，但是不应该支持负数 * 修改了最小的温度 2023-09-27 21:17:50 +08:00			`temperature: float = Body(TEMPERATURE, description="LLM 采样温度", ge=0.0, le=1.0),`
对话接口支持temperature参数 (#1455) 2023-09-13 10:00:54 +08:00			`# top_p: float = Body(TOP_P, description="LLM 核采样。勿与temperature同时设置", gt=0.0, lt=1.0),`
添加configs/prompt_config.py，允许用户自定义prompt模板： (#1504) 1、默认包含2个模板，分别用于LLM对话，知识库和搜索引擎对话 2、 server/utils.py提供函数get_prompt_template，获取指定的prompt模板内容（支持热加载） 3、 api.py中chat/knowledge_base_chat/search_engine_chat接口支持prompt_name参数 2023-09-17 13:27:11 +08:00			`prompt_name: str = Body("llm_chat", description="使用的prompt模板名称(在configs/prompt_config.py中配置)"),`
add history to chat apis 2023-08-08 23:54:51 +08:00			`):`
fix #1142: 在History中使用jinja2模板代替f-string，避免消息中含有{ }时出错 2023-08-23 08:35:26 +08:00			`history = [History.from_data(h) for h in history]`
update import pkgs and format 2023-08-10 21:26:05 +08:00
add history to chat apis 2023-08-08 23:54:51 +08:00			`async def chat_iterator(query: str,`
update server.chat.*: set default value [] to history parameter. 2023-08-09 10:48:37 +08:00			`history: List[History] = [],`
添加切换模型功能，支持智谱AI在线模型 (#1342) * 添加LLM模型切换功能，需要在server_config中设置可切换的模型 * add tests for api.py/llm_model/* * - 支持模型切换 - 支持智普AI线上模型 - startup.py增加参数`--api-worker`，自动运行所有的线上API模型。使用`-a (--all-webui), --all-api`时默认开启该选项 * 修复被fastchat覆盖的标准输出 * 对fastchat日志进行更细致的控制，startup.py中增加-q(--quiet)开关，可以减少无用的fastchat日志输出 * 修正chatglm api的对话模板 Co-authored-by: liunux4odoo <liunu@qq.com> 2023-09-01 23:58:09 +08:00			`model_name: str = LLM_MODEL,`
添加configs/prompt_config.py，允许用户自定义prompt模板： (#1504) 1、默认包含2个模板，分别用于LLM对话，知识库和搜索引擎对话 2、 server/utils.py提供函数get_prompt_template，获取指定的prompt模板内容（支持热加载） 3、 api.py中chat/knowledge_base_chat/search_engine_chat接口支持prompt_name参数 2023-09-17 13:27:11 +08:00			`prompt_name: str = prompt_name,`
add history to chat apis 2023-08-08 23:54:51 +08:00			`) -> AsyncIterable[str]:`
v0.2.0 first commit 2023-07-27 23:22:07 +08:00			`callback = AsyncIteratorCallbackHandler()`
优化configs (#1474) * remove llm_model_dict * optimize configs * fix get_model_path * 更改一些默认参数，添加千帆的默认配置 * Update server_config.py.example 2023-09-15 17:52:22 +08:00			`model = get_ChatOpenAI(`
添加切换模型功能，支持智谱AI在线模型 (#1342) * 添加LLM模型切换功能，需要在server_config中设置可切换的模型 * add tests for api.py/llm_model/* * - 支持模型切换 - 支持智普AI线上模型 - startup.py增加参数`--api-worker`，自动运行所有的线上API模型。使用`-a (--all-webui), --all-api`时默认开启该选项 * 修复被fastchat覆盖的标准输出 * 对fastchat日志进行更细致的控制，startup.py中增加-q(--quiet)开关，可以减少无用的fastchat日志输出 * 修正chatglm api的对话模板 Co-authored-by: liunux4odoo <liunu@qq.com> 2023-09-01 23:58:09 +08:00			`model_name=model_name,`
对话接口支持temperature参数 (#1455) 2023-09-13 10:00:54 +08:00			`temperature=temperature,`
优化configs (#1474) * remove llm_model_dict * optimize configs * fix get_model_path * 更改一些默认参数，添加千帆的默认配置 * Update server_config.py.example 2023-09-15 17:52:22 +08:00			`callbacks=[callback],`
v0.2.0 first commit 2023-07-27 23:22:07 +08:00			`)`
添加configs/prompt_config.py，允许用户自定义prompt模板： (#1504) 1、默认包含2个模板，分别用于LLM对话，知识库和搜索引擎对话 2、 server/utils.py提供函数get_prompt_template，获取指定的prompt模板内容（支持热加载） 3、 api.py中chat/knowledge_base_chat/search_engine_chat接口支持prompt_name参数 2023-09-17 13:27:11 +08:00
			`prompt_template = get_prompt_template(prompt_name)`
			`input_msg = History(role="user", content=prompt_template).to_msg_template(False)`
add history to chat apis 2023-08-08 23:54:51 +08:00			`chat_prompt = ChatPromptTemplate.from_messages(`
fix #1142: 在History中使用jinja2模板代替f-string，避免消息中含有{ }时出错 2023-08-23 08:35:26 +08:00			`[i.to_msg_template() for i in history] + [input_msg])`
add history to chat apis 2023-08-08 23:54:51 +08:00			`chain = LLMChain(prompt=chat_prompt, llm=model)`
v0.2.0 first commit 2023-07-27 23:22:07 +08:00
			`# Begin a task that runs in the background.`
			`task = asyncio.create_task(wrap_done(`
add history to chat apis 2023-08-08 23:54:51 +08:00			`chain.acall({"input": query}),`
v0.2.0 first commit 2023-07-27 23:22:07 +08:00			`callback.done),`
			`)`

fix chat and knowledge_base_chat 2023-08-14 10:35:47 +08:00			`if stream:`
			`async for token in callback.aiter():`
			`# Use server-sent-events to stream the response`
			`yield token`
			`else:`
			`answer = ""`
			`async for token in callback.aiter():`
			`answer += token`
			`yield answer`

v0.2.0 first commit 2023-07-27 23:22:07 +08:00			`await task`
add history to chat apis 2023-08-08 23:54:51 +08:00
添加configs/prompt_config.py，允许用户自定义prompt模板： (#1504) 1、默认包含2个模板，分别用于LLM对话，知识库和搜索引擎对话 2、 server/utils.py提供函数get_prompt_template，获取指定的prompt模板内容（支持热加载） 3、 api.py中chat/knowledge_base_chat/search_engine_chat接口支持prompt_name参数 2023-09-17 13:27:11 +08:00			`return StreamingResponse(chat_iterator(query=query,`
			`history=history,`
			`model_name=model_name,`
			`prompt_name=prompt_name),`
add history to chat apis 2023-08-08 23:54:51 +08:00			`media_type="text/event-stream")`