Tommy 42d986b96d [Pixtral] Add new model ; add vision (#11231) 1 year ago
..
__base c5f7d650b5 feat: Allow using file variables directly in the LLM node and support more file types. (#10679) 1 year ago
anthropic 02572e8cca fix: claude can not handle empty string (#11238) 1 year ago
azure_ai_studio 51db59622c chore(lint): cleanup repeated cause exception in logging.exception replaced by helpful message (#10425) 1 year ago
azure_openai 6f9ce6a199 fix: fix azure open-4o-08-06 when enable json schema cant process content = "" (#11204) 1 year ago
baichuan b90ad587c2 refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) 1 year ago
bedrock f54225568c fix(model_runtime): add vision to Amazon Nova Lite and Pro (#11398) 1 year ago
chatglm 40fb4d16ef chore: refurbish Python code by applying refurb linter rules (#8296) 1 year ago
cohere 5093337de1 FEAT: cohere rerank 3.5 model added (#11289) 1 year ago
deepseek aae29e72ae Fix Deepseek Function/Tool Calling (#11023) 1 year ago
fireworks b90ad587c2 refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) 1 year ago
fishaudio 448a19bf54 fix: fish audio wrong validate credentials interface (#11019) 1 year ago
gitee_ai 40fc6f529e fix: gitee ai wrong default model, and better para (#11168) 1 year ago
google 1ce51e57ab feat: add gemini exp 1206 (#11444) 1 year ago
gpustack 8aae235a71 fix: int None will cause error for context size (#11055) 1 year ago
groq 80c52e0ea4 feat: Add llama-3.3 models for Groq (#11533) 1 year ago
huggingface_hub b90ad587c2 refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) 1 year ago
huggingface_tei 096c0ad564 feat: Add support for TEI API key authentication (#11006) 1 year ago
hunyuan 92a3898540 fix: resolve the incorrect model name of hunyuan-standard-256k (#10052) 1 year ago
jina 8aae235a71 fix: int None will cause error for context size (#11055) 1 year ago
leptonai 2cf1187b32 chore(api/core): apply ruff reformatting (#7624) 1 year ago
localai 1e829ceaf3 chore: format get_customizable_model_schema return value (#9335) 1 year ago
minimax 32f8439143 fix: add the missing abab6.5t-chat model of Minimax (#11484) 1 year ago
mistralai 42d986b96d [Pixtral] Add new model ; add vision (#11231) 1 year ago
mixedbread b90ad587c2 refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) 1 year ago
moonshot 643a90c48d fix: use `removeprefix()` instead of `lstrip()` to remove the `data:` prefix (#11272) 1 year ago
nomic b90ad587c2 refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) 1 year ago
novita 2cf1187b32 chore(api/core): apply ruff reformatting (#7624) 1 year ago
nvidia b90ad587c2 refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) 1 year ago
nvidia_nim 2cf1187b32 chore(api/core): apply ruff reformatting (#7624) 1 year ago
oci b90ad587c2 refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) 1 year ago
ollama 7e1184c071 feat: support json_schema for ollama models (#11449) 1 year ago
openai aa135a3780 Add TTS to OpenAI_API_Compatible (#11071) 1 year ago
openai_api_compatible 643a90c48d fix: use `removeprefix()` instead of `lstrip()` to remove the `data:` prefix (#11272) 1 year ago
openllm 0067b16d1e fix: refactor all 'or []' and 'or {}' logic to make code more clear (#10883) 1 year ago
openrouter 4d6b45427c Support streaming output for OpenAI o1-preview and o1-mini (#10890) 1 year ago
perfxcloud 8aae235a71 fix: int None will cause error for context size (#11055) 1 year ago
replicate b90ad587c2 refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) 1 year ago
sagemaker 51db59622c chore(lint): cleanup repeated cause exception in logging.exception replaced by helpful message (#10425) 1 year ago
siliconflow ec00b25793 feat: add siliconflow qwq and llama3.3 model (#11492) 1 year ago
spark d0e0111f88 fix:Spark's large language model token calculation error #7911 (#8755) 1 year ago
stepfun 643a90c48d fix: use `removeprefix()` instead of `lstrip()` to remove the `data:` prefix (#11272) 1 year ago
tencent 40fb4d16ef chore: refurbish Python code by applying refurb linter rules (#8296) 1 year ago
togetherai 2cf1187b32 chore(api/core): apply ruff reformatting (#7624) 1 year ago
tongyi fbc4ca980c fix: Remove duplicate 'response_format' parameter from model YAML files (#11531) 1 year ago
triton_inference_server 1e829ceaf3 chore: format get_customizable_model_schema return value (#9335) 1 year ago
upstage b90ad587c2 refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) 1 year ago
vertex_ai 4d7cfd0de5 Fix model provider of vertex ai (#11437) 1 year ago
vessl_ai aa895cfa9b fix: [VESSL-AI] edit some words in vessl_ai.yaml (#10417) 1 year ago
volcengine_maas e79eac688a chore(lint): sort __all__ definitions (#11243) 1 year ago
voyage 8aae235a71 fix: int None will cause error for context size (#11055) 1 year ago
wenxin e39e776d03 fix: better wenxin rerank handler, close #11252 (#11283) 1 year ago
x 1065917872 Add grok-vision-beta to xAI + Update grok-beta Features (#11004) 1 year ago
xinference 03ba4bc760 fix error with xinference tool calling with qwen2-instruct and add timeout retry setttings for xinference (#11012) 1 year ago
yi e0846792d2 feat: add yi custom llm intergration (#9482) 1 year ago
zhinao 2cf1187b32 chore(api/core): apply ruff reformatting (#7624) 1 year ago
zhipuai 142b4fd699 feat: add zhipu glm_4v_flash (#11440) 1 year ago
__init__.py d069c668f8 Model Runtime (#1858) 2 years ago
_position.yaml fb49413a41 feat: add voyage ai as a new model provider (#8747) 1 year ago
model_provider_factory.py 4e7b6aec3a feat: support pinning, including, and excluding for model providers and tools (#7419) 1 year ago