Skip to content

Forem

# llm

👋 Sign in for the ability to sort posts by relevant, latest, or top.

May 22

I built a version manager for llama.cpp using nothing but vibe coding.

#showdev #ai #llm #vibecoding

3 min read

Cover image for Agent Series (3): Plan-and-Solve — Think First, Then Act

WonderLab

May 24

Agent Series (3): Plan-and-Solve — Think First, Then Act

#ai #agents #langchain #llm

10 min read

Cover image for Reasoning happens before the response

May 23

Reasoning happens before the response

#ai #mcp #llm #agents

5 min read

Cover image for When your AI CEO Lies about the Numbers

Cartone

May 23

When your AI CEO Lies about the Numbers

#discuss #ai #llm #learning

5 min read

Cover image for From Tokens to Attention: My First Real Mental Model of LLMs

May 23

From Tokens to Attention: My First Real Mental Model of LLMs

#ai #beginners #llm #machinelearning

5 min read

Rost

May 24

Qwen 3.6 27B and 35B MTP vs Standard on 16GB GPU

#selfhosting #llm #ai #llamacpp

8 min read

Cover image for One Open Source Project per Day #74: ai-engineering-from-scratch - Build AI Full-stack Skills from Ground Up

WonderLab

May 24

One Open Source Project per Day #74: ai-engineering-from-scratch - Build AI Full-stack Skills from Ground Up

#ai #opensource #llm #learning

2 min read

Ivan BUSH

May 23

What I learned building memory for Claude Code — measured against the popular alternative

#ai #claude #opensource #llm

8 min read

Lingdas1

May 23

GGUF & Modelfile: The Power User's Guide to Local LLMs

#gguf #llm #opensource #tutorial

5 min read

Lingdas1

May 23

Hardware Guide: What Do You Actually Need to Run Local LLMs?

#hardware #llm #opensource #guide

7 min read

Cover image for NVIDIA's Nemotron Diffusion: One Model, Three Generation Modes, 6 Faster

Andrew Kew

May 23

NVIDIA's Nemotron Diffusion: One Model, Three Generation Modes, 6 Faster

#ai #machinelearning #llm #nvidia

3 min read

May 23

LangChain JsonOutputParser: Fix Malformed JSON from LLMs

#python #langchain #llm #json

2 min read

Vainamoinen | Pulsed Media

May 23

Why Claude Code Sessions Diverge: A Mechanism Catalog

#ai #llm #agents #devops

3 min read

May 23

Building a cost-efficient LLM caching layer in Python

#python #ai #llm #performance

5 min read

soy

May 23

Gemma4 Apex GGUF, Ollama Context Optimization, & Llama3 Benchmarks

#ai #llm #selfhosted

3 min read

👋 Sign in for the ability to sort posts by relevant, latest, or top.