LLMs-Lab

Name: LLMs-Lab
Author: Eric-LLMs

Eric-LLMs August 18, 2024

6 copies 0 downloads

Full-stack LLM Engineering Lab. Features: Autonomous Agents (ReAct/AutoGPT) | Fine-Tuning Llama/Mistral (SFT/DPO) | Large Model Deployment (DeepSeek 671B / 2.5-bit) | Advanced RAG (Hybrid Search) | Function Calling (Stream/Text-to-SQL/External APIs) | Frameworks (LangChain, Semantic Kernel, OpenAI) | Daily SOTA Paper Tracking. From theory to 0-to-1

LLMs-Lab

9. 📂 DeepSeek

Focused on Inference Optimization and Low-Bit Quantization strategies for massive-scale MoE models (600B+ parameters).

DeepSeek-R1 (671B) 2.51-bit Extreme Quantization Deployment:
- Deployed the 2.51-bit quantized version (via Unsloth) of the 671B MoE model, achieving an ~80% reduction in memory footprint (from 720GB to ~212GB).
- Analyzed official and community benchmarks for 1.58-bit vs 2.51-bit configurations, ultimately selecting the 2.51-bit build (Q2_K_XL) to ensure superior reasoning stability on the H20 GPU cluster.
- 📄 View Hands-on Deployment Log & Benchmarks (PDF)

8. 📂 Fine-Tuning

This directory bridges the gap between theoretical architecture analysis and practical, memory-efficient fine-tuning of state-of-the-art open-source models. It covers the full lifecycle from pre-training understanding to post-training alignment.

Key Modules

Transformer Source Code Analysis: A deep dive into the vanilla Transformer architecture, focusing on a line-by-line implementation analysis of Self-Attention mechanisms, Multi-Head Attention, and Layer Normalization to understand the foundational building blocks.
Llama Series: QLoRA & Quantization: Implementation of QLoRA (Quantized Low-Rank Adapters) to fine-tune L

Comments

More Agents

View all

agentic-ai

Klaatcode

Open-source AI coding agent for the terminal. Claude Code-grade accuracy with smart model routing — uses the right AI model for each task, cutting costs 10x. Supports Claude, GPT, Gemini, DeepSeek & more.

KlaatAI

139

agent

Agentmaker

A general-purpose Python framework for building LLM agents and multi-agent systems. "Four lines of code, an agent with memory."

xinhuangcs

ai-api

Api Model Playground Cookbook

Ultimate LLM API Integration Cookbook 2026 for Cursor & AI Agents

09omerdgn-droid

150

agent-framework

Agent Ecologies

Ultimate Multi-Agent OS for Autonomous AI NPCs 2026

israriqbal

153

Private Agent

PrivateAgent is an open-source Android automation agent built with Flutter. It utilizes the DeepSeek API and native Android Accessibility Services to interpret screen layouts and execute multi-step tasks across any installed application via natural language commands.

orailnoor

123

Loom Novel

把一队分工 Agent 织成一条写小说的流水线,做成桌面客户端;写作指纹让它越写越像你(BYO DeepSeek key,纯本地)。

WadeZhao23

184

LLMs-Lab

LLMs-Lab

9. 📂 DeepSeek

8. 📂 Fine-Tuning

Key Modules

Tags

Comments

More Agents

Klaatcode

Agentmaker

Api Model Playground Cookbook

Agent Ecologies

Private Agent

Loom Novel

Ready-made automations for this