Giovanny Gutiérrez d7c0bc8c23 feat: Add response format support for openai compat models (#12240) 1 anno fa
..
__base 6a85960605 feat: implement asynchronous token counting in GPT2Tokenizer (#12239) 1 anno fa
anthropic 2681bafb76 fix: handle document fetching from URL in Anthropic LLM model, solving base64 decoding error (#11858) 1 anno fa
azure_ai_studio 51db59622c chore(lint): cleanup repeated cause exception in logging.exception replaced by helpful message (#10425) 1 anno fa
azure_openai c98d91e44d fix: o1 model error, use max_completion_tokens instead of max_tokens. (#12037) 1 anno fa
baichuan daccb10d8c fix: volcengine_maas and baichuan message error (#11625) 1 anno fa
bedrock 56e15d09a9 feat: mypy for all type check (#10921) 1 anno fa
chatglm 40fb4d16ef chore: refurbish Python code by applying refurb linter rules (#8296) 1 anno fa
cohere 56e15d09a9 feat: mypy for all type check (#10921) 1 anno fa
deepseek 79801f5c30 fix: deepseek reports an error when using Response Format #11677 (#11678) 1 anno fa
fireworks 56e15d09a9 feat: mypy for all type check (#10921) 1 anno fa
fishaudio 448a19bf54 fix: fish audio wrong validate credentials interface (#11019) 1 anno fa
gitee_ai 56e15d09a9 feat: mypy for all type check (#10921) 1 anno fa
google 56e15d09a9 feat: mypy for all type check (#10921) 1 anno fa
gpustack 8aae235a71 fix: int None will cause error for context size (#11055) 1 anno fa
groq d7c0bc8c23 feat: Add response format support for openai compat models (#12240) 1 anno fa
huggingface_hub 56e15d09a9 feat: mypy for all type check (#10921) 1 anno fa
huggingface_tei 6a0ff3686c fix: fix typo (#12034) 1 anno fa
hunyuan 56e15d09a9 feat: mypy for all type check (#10921) 1 anno fa
jina 56e15d09a9 feat: mypy for all type check (#10921) 1 anno fa
leptonai 2cf1187b32 chore(api/core): apply ruff reformatting (#7624) 1 anno fa
localai 1e829ceaf3 chore: format get_customizable_model_schema return value (#9335) 1 anno fa
minimax 56e15d09a9 feat: mypy for all type check (#10921) 1 anno fa
mistralai 42d986b96d [Pixtral] Add new model ; add vision (#11231) 1 anno fa
mixedbread b90ad587c2 refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) 1 anno fa
moonshot 643a90c48d fix: use `removeprefix()` instead of `lstrip()` to remove the `data:` prefix (#11272) 1 anno fa
nomic 56e15d09a9 feat: mypy for all type check (#10921) 1 anno fa
novita 2cf1187b32 chore(api/core): apply ruff reformatting (#7624) 1 anno fa
nvidia b90ad587c2 refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) 1 anno fa
nvidia_nim 2cf1187b32 chore(api/core): apply ruff reformatting (#7624) 1 anno fa
oci 56e15d09a9 feat: mypy for all type check (#10921) 1 anno fa
ollama 56e15d09a9 feat: mypy for all type check (#10921) 1 anno fa
openai d7c0bc8c23 feat: Add response format support for openai compat models (#12240) 1 anno fa
openai_api_compatible d7c0bc8c23 feat: Add response format support for openai compat models (#12240) 1 anno fa
openllm 56e15d09a9 feat: mypy for all type check (#10921) 1 anno fa
openrouter 4d6b45427c Support streaming output for OpenAI o1-preview and o1-mini (#10890) 1 anno fa
perfxcloud 56e15d09a9 feat: mypy for all type check (#10921) 1 anno fa
replicate 56e15d09a9 feat: mypy for all type check (#10921) 1 anno fa
sagemaker 9954ddb780 [Fix] modify sagemaker llm (#12274) 1 anno fa
siliconflow 56e15d09a9 feat: mypy for all type check (#10921) 1 anno fa
spark 56e15d09a9 feat: mypy for all type check (#10921) 1 anno fa
stepfun 643a90c48d fix: use `removeprefix()` instead of `lstrip()` to remove the `data:` prefix (#11272) 1 anno fa
tencent 40fb4d16ef chore: refurbish Python code by applying refurb linter rules (#8296) 1 anno fa
togetherai 56e15d09a9 feat: mypy for all type check (#10921) 1 anno fa
tongyi 56e15d09a9 feat: mypy for all type check (#10921) 1 anno fa
triton_inference_server 1e829ceaf3 chore: format get_customizable_model_schema return value (#9335) 1 anno fa
upstage 56e15d09a9 feat: mypy for all type check (#10921) 1 anno fa
vertex_ai 56e15d09a9 feat: mypy for all type check (#10921) 1 anno fa
vessl_ai 56e15d09a9 feat: mypy for all type check (#10921) 1 anno fa
volcengine_maas 56e15d09a9 feat: mypy for all type check (#10921) 1 anno fa
voyage 8aae235a71 fix: int None will cause error for context size (#11055) 1 anno fa
wenxin 2a909e634b feat: support Ernie-lite-pro-128k (#12161) 1 anno fa
x cf0ff88120 feat: add grok-2-1212 and grok-2-vision-1212 (#11672) 1 anno fa
xinference 56e15d09a9 feat: mypy for all type check (#10921) 1 anno fa
yi 56e15d09a9 feat: mypy for all type check (#10921) 1 anno fa
zhinao 2cf1187b32 chore(api/core): apply ruff reformatting (#7624) 1 anno fa
zhipuai 56e15d09a9 feat: mypy for all type check (#10921) 1 anno fa
__init__.py d069c668f8 Model Runtime (#1858) 2 anni fa
_position.yaml fb49413a41 feat: add voyage ai as a new model provider (#8747) 1 anno fa
model_provider_factory.py 4e7b6aec3a feat: support pinning, including, and excluding for model providers and tools (#7419) 1 anno fa