Github Projects

vLLM

2024-04-28

vLLM is a high-throughput and memory-efficient inference and serving engine for large language models, offering seamless integration with popular models and optimized performance.

MaxKB

2024-04-27

MaxKB is an open-source knowledge base Q&A system powered by large language models and RAG, designed for intelligent customer service, corporate knowledge bases, and academic research.

Open WebUI

2024-04-26

Open WebUI is a user-friendly, extensible self-hosted web interface for LLMs, supporting Ollama, OpenAI-compatible APIs, and advanced features like RAG, voice/video calls, and multi-model conversations.

LLaMA-Factory

2024-04-23

LLaMA-Factory is a unified framework for efficient fine-tuning of 100+ large language models, supporting various training approaches and optimizations.

Dify

2024-04-20

Dify is an open-source platform for building and deploying LLM applications with workflows, RAG, agents, and observability.

llm.c

2024-04-18

A lightweight implementation of Large Language Models (LLMs) in pure C/CUDA, focusing on pretraining and reproducibility of GPT-2 and GPT-3 models.

Llama3

2024-04-14

Llama3 is Meta's latest large language model, offering pre-trained and instruction-tuned models ranging from 8B to 70B parameters for responsible experimentation and innovation.

RAGFlow

2024-04-13

RAGFlow is an open-source RAG engine that combines deep document understanding with LLMs for accurate, citation-backed question answering from complex data formats.

Plandex

2024-04-07

Plandex is a terminal-based AI tool that helps plan and execute complex coding tasks across multiple files with a large context window.

LLM Answer Engine

2024-04-06

A sophisticated answer engine leveraging Groq, Mistral AI's Mixtral, Langchain.JS, and more to provide comprehensive responses including sources, answers, images, and follow-up questions.

Mojo

2024-04-05

Mojo is part of the Modular platform, offering an integrated suite of AI tools and libraries to streamline deployment workflows and accelerate innovation.

VoiceCraft

2024-04-03

VoiceCraft is a token infilling neural codec language model for speech editing and zero-shot TTS on in-the-wild data.

MoneyPrinterTurbo

2024-04-01

An AI-powered tool for generating high-quality videos automatically, supporting multiple models and APIs for seamless content creation.

Developer Roadmap

2024-03-30

An interactive, community-driven collection of roadmaps and best practices to guide developers through various tech career paths and skills.

Open Interpreter

2024-03-28

Open Interpreter allows LLMs to run Python, JavaScript, Shell, and more locally through a ChatGPT-like terminal interface.

OpenHands

2024-03-27

OpenHands is an open-source platform enabling AI-powered software development agents to perform tasks like coding, running commands, and browsing the web.

GPT Pilot

2024-03-23

GPT Pilot is an AI developer companion that writes, debugs, and collaborates on full-feature development under developer oversight.

Full Stack FastAPI Template

2024-03-22

A production-ready full-stack web template with FastAPI backend, React frontend, and Docker deployment.

Open-Sora

2024-03-20

Open-Sora is an open-source initiative dedicated to efficiently producing high-quality video, making advanced video generation techniques accessible to everyone.

Grok-1

2024-03-17

Grok-1 is an open-weights large language model with 314B parameters, featuring a Mixture of 8 Experts architecture and advanced capabilities like activation sharding and 8-bit quantization.

Screenshot-to-Code

2024-03-04

A tool that converts screenshots, mockups, and Figma designs into functional code using AI models like Claude Sonnet 3.7 and GPT-4o.