SiliconFlow, Inc dc650c5368 Fixes #12414: Add cheaper model and long context model for Qwen2.5-72B-Instruct from siliconflow (#12415) vor 1 Jahr
..
__base 6f5a8a33d9 refactor: replace gevent threadpool with ProcessPoolExecutor in GPT2Tokenizer (#12316) vor 1 Jahr
anthropic 2681bafb76 fix: handle document fetching from URL in Anthropic LLM model, solving base64 decoding error (#11858) vor 1 Jahr
azure_ai_studio 51db59622c chore(lint): cleanup repeated cause exception in logging.exception replaced by helpful message (#10425) vor 1 Jahr
azure_openai c98d91e44d fix: o1 model error, use max_completion_tokens instead of max_tokens. (#12037) vor 1 Jahr
baichuan daccb10d8c fix: volcengine_maas and baichuan message error (#11625) vor 1 Jahr
bedrock 56e15d09a9 feat: mypy for all type check (#10921) vor 1 Jahr
chatglm 40fb4d16ef chore: refurbish Python code by applying refurb linter rules (#8296) vor 1 Jahr
cohere 56e15d09a9 feat: mypy for all type check (#10921) vor 1 Jahr
deepseek 79801f5c30 fix: deepseek reports an error when using Response Format #11677 (#11678) vor 1 Jahr
fireworks 56e15d09a9 feat: mypy for all type check (#10921) vor 1 Jahr
fishaudio 448a19bf54 fix: fish audio wrong validate credentials interface (#11019) vor 1 Jahr
gitee_ai 6df17a334c fix: Update the API call address for the text_embedding model (#12342) vor 1 Jahr
google 56e15d09a9 feat: mypy for all type check (#10921) vor 1 Jahr
gpustack 2bb521b135 Support TTS and Speech2Text for Model Provider GPUStack (#12381) vor 1 Jahr
groq d7c0bc8c23 feat: Add response format support for openai compat models (#12240) vor 1 Jahr
huggingface_hub 56e15d09a9 feat: mypy for all type check (#10921) vor 1 Jahr
huggingface_tei 6a0ff3686c fix: fix typo (#12034) vor 1 Jahr
hunyuan baeddd4d15 feat:Add support for stop parameter in hunyuan model #12313 (#12315) vor 1 Jahr
jina 56e15d09a9 feat: mypy for all type check (#10921) vor 1 Jahr
leptonai 2cf1187b32 chore(api/core): apply ruff reformatting (#7624) vor 1 Jahr
localai 1e829ceaf3 chore: format get_customizable_model_schema return value (#9335) vor 1 Jahr
minimax 56e15d09a9 feat: mypy for all type check (#10921) vor 1 Jahr
mistralai 42d986b96d [Pixtral] Add new model ; add vision (#11231) vor 1 Jahr
mixedbread b90ad587c2 refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) vor 1 Jahr
moonshot 3c2e30f348 fix: #12143 support streaming mode content start with "data:" (#12171) vor 1 Jahr
nomic 56e15d09a9 feat: mypy for all type check (#10921) vor 1 Jahr
novita 2cf1187b32 chore(api/core): apply ruff reformatting (#7624) vor 1 Jahr
nvidia b90ad587c2 refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) vor 1 Jahr
nvidia_nim 2cf1187b32 chore(api/core): apply ruff reformatting (#7624) vor 1 Jahr
oci 56e15d09a9 feat: mypy for all type check (#10921) vor 1 Jahr
ollama 56e15d09a9 feat: mypy for all type check (#10921) vor 1 Jahr
openai d7c0bc8c23 feat: Add response format support for openai compat models (#12240) vor 1 Jahr
openai_api_compatible 3c2e30f348 fix: #12143 support streaming mode content start with "data:" (#12171) vor 1 Jahr
openllm 56e15d09a9 feat: mypy for all type check (#10921) vor 1 Jahr
openrouter 4d6b45427c Support streaming output for OpenAI o1-preview and o1-mini (#10890) vor 1 Jahr
perfxcloud 56e15d09a9 feat: mypy for all type check (#10921) vor 1 Jahr
replicate 56e15d09a9 feat: mypy for all type check (#10921) vor 1 Jahr
sagemaker 147d578922 [Fix] revert sagemaker llm to support model hub (#12378) vor 1 Jahr
siliconflow dc650c5368 Fixes #12414: Add cheaper model and long context model for Qwen2.5-72B-Instruct from siliconflow (#12415) vor 1 Jahr
spark 56e15d09a9 feat: mypy for all type check (#10921) vor 1 Jahr
stepfun 3c2e30f348 fix: #12143 support streaming mode content start with "data:" (#12171) vor 1 Jahr
tencent 40fb4d16ef chore: refurbish Python code by applying refurb linter rules (#8296) vor 1 Jahr
togetherai 56e15d09a9 feat: mypy for all type check (#10921) vor 1 Jahr
tongyi 56e15d09a9 feat: mypy for all type check (#10921) vor 1 Jahr
triton_inference_server 1e829ceaf3 chore: format get_customizable_model_schema return value (#9335) vor 1 Jahr
upstage 56e15d09a9 feat: mypy for all type check (#10921) vor 1 Jahr
vertex_ai 56e15d09a9 feat: mypy for all type check (#10921) vor 1 Jahr
vessl_ai 56e15d09a9 feat: mypy for all type check (#10921) vor 1 Jahr
volcengine_maas 56e15d09a9 feat: mypy for all type check (#10921) vor 1 Jahr
voyage 8aae235a71 fix: int None will cause error for context size (#11055) vor 1 Jahr
wenxin 2a909e634b feat: support Ernie-lite-pro-128k (#12161) vor 1 Jahr
x cf0ff88120 feat: add grok-2-1212 and grok-2-vision-1212 (#11672) vor 1 Jahr
xinference 56e15d09a9 feat: mypy for all type check (#10921) vor 1 Jahr
yi 56e15d09a9 feat: mypy for all type check (#10921) vor 1 Jahr
zhinao 2cf1187b32 chore(api/core): apply ruff reformatting (#7624) vor 1 Jahr
zhipuai 7c1961e618 feat: Add response format support to GLM-4 (#12252) vor 1 Jahr
__init__.py d069c668f8 Model Runtime (#1858) vor 2 Jahren
_position.yaml fb49413a41 feat: add voyage ai as a new model provider (#8747) vor 1 Jahr
model_provider_factory.py 4e7b6aec3a feat: support pinning, including, and excluding for model providers and tools (#7419) vor 1 Jahr