44 lines
818 B
Markdown
44 lines
818 B
Markdown
|
|
# 大模型技术栈-实战与应用
|
||
|
|
- 训练框架
|
||
|
|
- deepspeed
|
||
|
|
- megatron-lm
|
||
|
|
- colossal-ai
|
||
|
|
- trlx
|
||
|
|
- 推理框架
|
||
|
|
- triton
|
||
|
|
- vllm
|
||
|
|
- text-generation-inference
|
||
|
|
- lit-llama
|
||
|
|
- lightllm
|
||
|
|
- TensorRT-LLM(原FasterTransformer)
|
||
|
|
- fastllm
|
||
|
|
- inferllm
|
||
|
|
- llama-cpp
|
||
|
|
- openPPL-LLM
|
||
|
|
- 压缩框架
|
||
|
|
- bitsandbytes
|
||
|
|
- auto-gptq
|
||
|
|
- deepspeed
|
||
|
|
- embedding框架
|
||
|
|
- sentence-transformer
|
||
|
|
- FlagEmbedding
|
||
|
|
- 向量数据库 [向量数据库对比]("https://www.jianshu.com/p/43cc19426113")
|
||
|
|
- faiss
|
||
|
|
- pgvector
|
||
|
|
- milvus
|
||
|
|
- pinecone
|
||
|
|
- weaviate
|
||
|
|
- LanceDB
|
||
|
|
- Chroma
|
||
|
|
- 应用框架
|
||
|
|
- Auto-GPT
|
||
|
|
- langchain
|
||
|
|
- llama-index
|
||
|
|
- quivr
|
||
|
|
- python前端
|
||
|
|
- streamlit
|
||
|
|
- gradio
|
||
|
|
- python API工具
|
||
|
|
- FastAPI+uvicorn
|
||
|
|
- flask
|
||
|
|
- Django
|