Tutorials & Deep Dives

PyQt5 desktop GUI for fine-tuning, evaluating, and deploying LLMs using torchtune — no command-line required. Full visual workflow.

torchtuneLoRAPyQt5

LLM Architectures & Deployment

ArchitecturesTransformersDeployment

Comprehensive guides for working with Large Language Models — architectures, training pipelines, and deployment strategies explained in depth.

LoRA: From Scratch Implementation

KV CacheCompressionInference

Low-Rank Adaptation of Large Language Models — parameter-efficient fine-tuning implemented from scratch with detailed code walkthrough.

LoRAPEFTFrom Scratch

Efficient Inference & Compression

KV cache optimization, model quantization, and LLM compression research

KV Cache & LLM Compression Papers

150+ Papers

Curated collection of 150+ research papers on KV Cache Management, KV Cache Compression, and LLM Compression for efficient inference.

Awesome Model Quantization

Papers

Curated list of papers, docs, and code about model quantization — aimed at providing comprehensive info for quantization research.

QuantizationQATGPTQ

YOLOX Quantization & Distillation

Hands-on tutorial combining YOLOX with Quantization Aware Training and Knowledge Distillation for efficient real-time object detection.

YOLOXQATKD

ONNX: Complete Tutorial

ONNXDeploymentOptimization

A comprehensive, book-style tutorial covering everything about ONNX — from fundamentals to production deployment and optimization.

Diffusion Models

Image generation, text diffusion, and denoising from theory to implementation

Diffusion Language Models

Diffusion LMText GenPyTorch

Text generation using diffusion-style denoising — iteratively refining noisy sequences into coherent text, an alternative to autoregressive LLMs.

Reparameterization Denoising

DenoisingDiffusionPyTorch

Implementation of diffusion-based denoising with reparameterization — organized and shared with detailed code walkthrough.

Diffusion Models: Survey & Taxonomy

Papers

Comprehensive survey and taxonomy of diffusion model papers — organized by architecture, application, and training methodology.

SurveyTaxonomyResearch

Attention & Transformers

Efficient attention mechanisms, linear attention, and transformer architecture surveys

Attention Mechanisms: 3 Surveys

AttentionTransformersEfficiency

Three in-depth surveys covering efficient transformer architectures, attention variants, and optimization techniques for scalable inference.

REGLA: Gated Linear Attention

Linear AttentionEfficientResearch

Refining Gated Linear Attention — efficient alternative to softmax attention for scalable sequence modeling with linear complexity.

From PCA to VAE

Understanding dimensionality reduction from classical PCA through autoencoders to Variational Autoencoders — theory and implementation.

PCAVAERepresentation

Computer Vision

Image enhancement, object detection, segmentation, and low-level vision

Low-Level Vision: Complete Guide

DenoisingSuper-ResEnhancement

Super-Resolution, denoising, deblurring, dehazing, low-light enhancement, artifact removal — end-to-end models with PSNR/SSIM benchmarks.

FlashDet: End-to-End Detection

Complete training system with PyQt5 desktop app — LoRA/QLoRA fine-tuning, knowledge distillation, ONNX export, INT8 quantization. 100+ FPS.

DetectionLoRAKD

YOLOv8 PyTorch Implementation

Modular PyTorch implementation of YOLOv8 for object detection with clear architecture, training pipelines, and detailed documentation.

YOLOv8PyTorchDetection

Feature Detection from Scratch

FeaturesClassical CVFrom Scratch

From-scratch implementations of classic feature detection and description algorithms — SIFT, SURF, ORB, Harris, and more.

Image Object Removal & Inpainting

Remove objects from photos including shadows and reflections using generative inpainting — end-to-end diffusion-based restoration.

InpaintingGenAIDiffusion

Bayer Low-Light Enhancement

Low-light image enhancement directly on Bayer pattern data — RAW image processing with deep learning for mobile camera pipelines.

Low-LightRAWMobile

ML Systems & Production

System design, data drift, monitoring, and production ML architecture

ML System Design Guide