ModelCloud.ai · GitHub

ModelCloud.ai

Our mission is to give allow everyone, including bots, unlimited and free access to llm/ai models.

Verified
We've verified that the organization ModelCloud controls the domain:
- modelcloud.ai
Learn more about verified organizations

Overview
Repositories
Projects
Packages
People

Pinned Loading

GPTQModel GPTQModel Public

LLM model quantization (compression) toolkit with HW acceleration support for Nvidia, AMD, Intel GPU and Intel/AMD/Apple CPU via HF, vLLM, and SGLang.

Python 1.2k 184
Device-SMI Device-SMI Public

Self-contained Python lib with zero-dependencies that give you a unified device properties for gpu, cpu, and npu. No more calling separate tools such as nvidia-smi or /proc/cpuinfo and parsing it y…

Python 15 1

Repositories

Type Language Sort

GPTQModel Public
LLM model quantization (compression) toolkit with HW acceleration support for Nvidia, AMD, Intel GPU and Intel/AMD/Apple CPU via HF, vLLM, and SGLang.

ModelCloud/GPTQModel’s past year of commit activity

Python 1,156 184 43 1 Updated May 25, 2026
Defuser Public
Model defuser helper for HF Transformers

ModelCloud/Defuser’s past year of commit activity

Python 2 Apache-2.0 0 0 0 Updated May 21, 2026
PyPcre Public

ModelCloud/PyPcre’s past year of commit activity

Python 4 Apache-2.0 2 0 0 Updated May 6, 2026
Evalution Public
Evalution: evolve your LLMs with better evals.

ModelCloud/Evalution’s past year of commit activity

Python 15 Apache-2.0 1 0 1 Updated Apr 28, 2026
Device-SMI Public
Self-contained Python lib with zero-dependencies that give you a unified device properties for gpu, cpu, and npu. No more calling separate tools such as nvidia-smi or /proc/cpuinfo and parsing it yourself.

ModelCloud/Device-SMI’s past year of commit activity

Python 15 Apache-2.0 1 0 0 Updated Apr 23, 2026
LogBar Public
A unified Logger and ProgressBar util with zero dependencies.

ModelCloud/LogBar’s past year of commit activity

Python 9 Apache-2.0 0 0 0 Updated Apr 22, 2026
Tokenicer Public
A (nicer) tokenizer you want to use for model inference and training: with all known peventable gotchas normalized or auto-fixed.

ModelCloud/Tokenicer’s past year of commit activity

Python 11 Apache-2.0 4 0 0 Updated Apr 22, 2026
MemLord Public

ModelCloud/MemLord’s past year of commit activity

Python 1 Apache-2.0 0 0 1 Updated Apr 16, 2026
vllm Public Forked from vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs

ModelCloud/vllm’s past year of commit activity

Python 1 Apache-2.0 17,359 0 0 Updated Mar 26, 2026
sglang Public Forked from sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.

ModelCloud/sglang’s past year of commit activity

Python 0 Apache-2.0 6,138 0 0 Updated Mar 26, 2026

View all repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…

Terms
Privacy
Security
Status
Community
Docs
Contact
Manage cookies
Do not share my personal information

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ModelCloud.ai

Pinned Loading

Repositories

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

People

Top languages

Uh oh!

Most used topics

Uh oh!

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ModelCloud.ai

Pinned Loading

Repositories

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

People

Top languages

Uh oh!

Most used topics

Uh oh!

Footer

Footer navigation