Skip to content

Popular repositories Loading

  1. lorax lorax Public

    Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

    Python 3.7k 311

  2. llm_distillation_playbook llm_distillation_playbook Public

    Best practices for distilling large language models.

    Jupyter Notebook 617 55

  3. lora_bakeoff lora_bakeoff Public

    Python 20 2

  4. json-mode-benchmark json-mode-benchmark Public

    Jupyter Notebook 7 1

  5. neuropod neuropod Public

    Forked from uber/neuropod

    A uniform interface to run deep learning models from multiple frameworks

    C++ 3 2

  6. punica punica Public

    Forked from punica-ai/punica

    Serving multiple LoRA finetuned LLM as one

    Cuda 2 4

Repositories

Showing 10 of 20 repositories
  • litellm Public Forked from BerriAI/litellm

    Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, VLLM, NVIDIA NIM]

    predibase/litellm’s past year of commit activity
    Python 0 6,953 0 0 Updated Mar 31, 2026
  • rubrik-litellm-integration Public

    Integrating LiteLLM into RAC

    predibase/rubrik-litellm-integration’s past year of commit activity
    Python 0 0 0 0 Updated Mar 20, 2026
  • seldon-core Public archive Forked from SeldonIO/seldon-core

    An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models

    predibase/seldon-core’s past year of commit activity
    HTML 0 Apache-2.0 872 0 0 Updated Sep 16, 2025
  • lorax Public

    Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

    predibase/lorax’s past year of commit activity
    Python 3,741 Apache-2.0 311 150 (16 issues need help) 28 Updated May 21, 2025
  • lora_bakeoff Public
    predibase/lora_bakeoff’s past year of commit activity
    Python 20 2 2 0 Updated Sep 5, 2024
  • predibase/json-mode-benchmark’s past year of commit activity
    Jupyter Notebook 7 1 0 0 Updated Mar 3, 2024
  • llm_distillation_playbook Public

    Best practices for distilling large language models.

    predibase/llm_distillation_playbook’s past year of commit activity
    Jupyter Notebook 617 55 1 0 Updated Feb 1, 2024
  • huggingface_hub Public Forked from huggingface/huggingface_hub

    The official Python client for the Huggingface Hub.

    predibase/huggingface_hub’s past year of commit activity
    Python 0 Apache-2.0 1,003 0 0 Updated Dec 18, 2023
  • volcano Public archive Forked from volcano-sh/volcano

    A Cloud Native Batch System (Project under CNCF)

    predibase/volcano’s past year of commit activity
    Go 0 Apache-2.0 1,343 0 1 Updated Dec 4, 2023
  • punica Public Forked from punica-ai/punica

    Serving multiple LoRA finetuned LLM as one

    predibase/punica’s past year of commit activity
    Cuda 2 62 0 0 Updated Nov 24, 2023

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Most used topics

Loading…