Skip to content
@basetenlabs

Baseten

Machine learning infrastructure for developers

Welcome to Baseten

Baseten is an AI infrastructure platform. We combine applied performance research, distributed multi-cloud infrastructure, and developer tooling to run models of all modalities in production.

Get started:

  • Deploy an open-source model in two clicks from the model library.
  • Read our docs to package and serve a fine-tuned or custom model.

Pinned Loading

  1. truss truss Public

    The simplest way to serve AI/ML models in production

    Python 1.1k 96

  2. truss-examples truss-examples Public

    Examples of models deployable with Truss

    Python 220 58

Repositories

Showing 10 of 89 repositories
  • genai-bench Public Forked from sgl-project/genai-bench

    Genai-bench is a powerful benchmark tool designed for comprehensive token-level performance evaluation of large language model (LLM) serving systems.

    basetenlabs/genai-bench’s past year of commit activity
    Python 2 MIT 50 0 6 Updated Mar 31, 2026
  • truss-examples Public

    Examples of models deployable with Truss

    basetenlabs/truss-examples’s past year of commit activity
    Python 220 MIT 58 15 64 Updated Mar 31, 2026
  • pyannote-audio Public Forked from pyannote/pyannote-audio

    Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

    basetenlabs/pyannote-audio’s past year of commit activity
    Jupyter Notebook 0 MIT 1,046 0 3 Updated Mar 30, 2026
  • ml-cookbook Public

    Ready-to-use ML training recipes to help you build and deploy models on Baseten.

    basetenlabs/ml-cookbook’s past year of commit activity
    Python 49 MIT 4 0 17 Updated Mar 30, 2026
  • basetenlabs/langchain-baseten’s past year of commit activity
    Python 0 MIT 1 0 6 Updated Mar 30, 2026
  • prime-rl Public Forked from PrimeIntellect-ai/prime-rl

    Async RL Training at Scale

    basetenlabs/prime-rl’s past year of commit activity
    Python 1 Apache-2.0 246 0 12 Updated Mar 30, 2026
  • truss Public

    The simplest way to serve AI/ML models in production

    basetenlabs/truss’s past year of commit activity
    Python 1,131 MIT 96 9 58 Updated Mar 30, 2026
  • kingkong Public
    basetenlabs/kingkong’s past year of commit activity
    Python 1 BSD-3-Clause 0 0 7 Updated Mar 30, 2026
  • Model-Optimizer Public Forked from NVIDIA/Model-Optimizer

    A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.

    basetenlabs/Model-Optimizer’s past year of commit activity
    Python 1 Apache-2.0 321 0 3 Updated Mar 30, 2026
  • openclaw-baseten Public Forked from openclaw/openclaw

    Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

    basetenlabs/openclaw-baseten’s past year of commit activity
    TypeScript 1 MIT 67,831 0 11 Updated Mar 28, 2026

Most used topics

Loading…