llm tag | Devtalk

Week Month Year 3 Years Over 3 Years

Latest #llm Threads

AI>In The News

LLM Inference in Production

A practical handbook for engineers building, optimizing, scaling and operating LLM inference systems in production.

bentoml.com

#production #llm #introduction

0 8 0

2025-07-13 01:22:58 UTC

New

Backend>Blogs/Talks

Thinking Elixir Podcast 260: Cheaper testing with AI?

Thinking Elixir 260: Cheaper testing with AI? Episode 260 of Thinking Elixir. News includes LiveDebugger v0.3.0 with enhanced debugging ...

youtube.com

#ai /elixir #podcasts #testing #debugging #llm #oban

0 2 0

2025-07-08 12:22:34 UTC

New

AI>In The News

Optimizing Tool Selection for LLM Workflows: Differentiable Programming with PyTorch and DSPy

How local, learnable routers can reduce token overhead, lower costs, and bring structure back to agentic workflows.

viksit.substack.com

#programming #llm

0 1 0

2025-07-07 14:16:12 UTC

New

AI>Libraries/Tools

Open WebUI: self-hosted AI platform designed to operate entirely offline

I just came across this tool. It looks interesting. Open WebUI is an extensible, feature-rich, and user-friendly self-hosted AI platfor...

#ai #llm #ollama #self-hosting

0 1 0

2025-07-07 05:23:54 UTC

New

Linux>Official News

openSUSE: SUSE Refines, Releases Open-Source LLM to Fuel Community Collaboration

A new openSUSE blog post/announcement has been posted! Get the full details here: SUSE Refines, Releases Open-Source LLM to Fuel Commun...

news.opensuse.org

#community #suse /opensuse #official-news #llm

0 79 0

2025-06-24 12:29:03 UTC

New

AI>In The News

AI system development: LLM → RAG → AI Workflow → AI Agent | CodeLink

Confused by LLMs, RAG, & AI Agents? We break down the spectrum of AI system design with a familiar resume-screening example to show t...

codelink.io

#development #llm #rag

0 121 0

2025-06-19 15:12:02 UTC

New

AI>In The News

LLM agents flunk CRM and confidentiality tasks

: 6-in-10 success rate for single-step tasks

theregister.com

#llm

0 76 0

2025-06-17 14:27:42 UTC

New

AI>In The News

Reverse Engineering Cursor's LLM Client · TensorZero

Reverse Engineering Cursor’s LLM Client

tensorzero.com

#llm #cursor

0 194 0

2025-06-07 15:54:20 UTC

New

Backend>Blogs/Talks

LLMS & Elixir: Windfall or Deathblow?

How the Elixir community can survive — and thrive — in an age of LLMs.

open.substack.com

#llm

1 271 0

2025-06-01 16:31:07 UTC

New

AI>In The News

LLM Codegen go Brrr – Parallelization with Git Worktrees and Tmux | Category | Trieve

If you’re underwhelmed with AI coding agents or simply want to get more out of them, give parallelization a try. After seeing the results...

skeptrune.com

#git

0 128 0

2025-05-29 04:09:26 UTC

New

This Week's Trending

Backend>Blogs/Talks

Thinking Elixir Podcast 260: Cheaper testing with AI?

Thinking Elixir 260: Cheaper testing with AI? Episode 260 of Thinking Elixir. News includes LiveDebugger v0.3.0 with enhanced debugging ...

youtube.com

#ai /elixir #podcasts #testing #debugging #llm #oban

0 2 0

2025-07-08 12:22:34 UTC

New

AI>In The News

LLM Inference in Production

A practical handbook for engineers building, optimizing, scaling and operating LLM inference systems in production.

bentoml.com

#production #llm #introduction

0 8 0

2025-07-13 01:22:58 UTC

New

This Month's Trending

AI>In The News

AI system development: LLM → RAG → AI Workflow → AI Agent | CodeLink

Confused by LLMs, RAG, & AI Agents? We break down the spectrum of AI system design with a familiar resume-screening example to show t...

codelink.io

#development #llm #rag

0 121 0

2025-06-19 15:12:02 UTC

New

AI>Libraries/Tools

Open WebUI: self-hosted AI platform designed to operate entirely offline

I just came across this tool. It looks interesting. Open WebUI is an extensible, feature-rich, and user-friendly self-hosted AI platfor...

#ai #llm #ollama #self-hosting

0 1 0

2025-07-07 05:23:54 UTC

New

Linux>Official News

openSUSE: SUSE Refines, Releases Open-Source LLM to Fuel Community Collaboration

A new openSUSE blog post/announcement has been posted! Get the full details here: SUSE Refines, Releases Open-Source LLM to Fuel Commun...

news.opensuse.org

#community #suse /opensuse #official-news #llm

0 79 0

2025-06-24 12:29:03 UTC

New

AI>In The News

LLM agents flunk CRM and confidentiality tasks

: 6-in-10 success rate for single-step tasks

theregister.com

#llm

0 76 0

2025-06-17 14:27:42 UTC

New

AI>In The News

Optimizing Tool Selection for LLM Workflows: Differentiable Programming with PyTorch and DSPy

How local, learnable routers can reduce token overhead, lower costs, and bring structure back to agentic workflows.

viksit.substack.com

#programming #llm

0 1 0

2025-07-07 14:16:12 UTC

New

This Year's Trending

AI>Chat

DeepSeek - the free, open source “ChatGPT killer”

Loads of news stories about DeepSeek here in the last few days, no surprise as it’s been making headlines across the world! Currently a h...

#ai #chatgpt #openai #llm /deepseek

3 205 6

2025-02-03 22:26:29 UTC

New

AI>In The News

As an Experienced LLM User, I Actually Don't Use Generative LLMs Often

But for what I do use LLMs for, it’s invaluable.

minimaxir.com

#llm

1 332 1

2025-05-06 08:12:35 UTC

New

Backend>Blogs/Talks

LLMS & Elixir: Windfall or Deathblow?

How the Elixir community can survive — and thrive — in an age of LLMs.

open.substack.com

#llm

1 271 0

2025-06-01 16:31:07 UTC

New

AI>In The News

Reverse Engineering Cursor's LLM Client · TensorZero

Reverse Engineering Cursor’s LLM Client

tensorzero.com

#llm #cursor

0 194 0

2025-06-07 15:54:20 UTC

New

General Dev>In The News

Use Prolog to improve LLM's reasoning

Use Prolog to improve LLM’s reasoning. On one side, LLMs show unseen capabilities in reasoning, but on the other - reasoning in LLMs is ...

shchegrikovich.substack.com

/prolog #llm

0 194 0

2024-10-18 03:22:58 UTC

New

Backend>Official News

Building LLM-powered applications in Go

A new Go blog post/announcement has been posted! Get the full details here: Building LLM-powered applications in Go - The Go Programmin...

go.dev

/go #official-news #llm

0 109 0

2024-09-12 22:17:34 UTC

New

AI>In The News

Build Real-Time Knowledge Graph For Documents with LLM | CocoIndex

CocoIndex now supports knowledge graph with incremental processing. Build live knowledge for agents is super easy with CocoIndex!

cocoindex.io

#knowledge

0 161 0

2025-05-14 03:27:44 UTC

New

General Dev>In The News

Every FLOP Counts: Scaling a 300B Mixture-of-Experts LING LLM without Premium GPUs

In this technical report, we tackle the challenges of training large-scale Mixture of Experts (MoE) models, focusing on overcoming cost i...

arxiv.org

#flop #llm

0 162 0

2025-03-28 21:38:53 UTC

New

General Dev>In The News

LLM bots + Next.js image optimization = recipe for bankruptcy (post-mortem) | Metacast Blog

A misconfiguration that might have cost us $7,000

metacast.app

#blog #bots #llm

0 160 0

2025-04-15 14:52:01 UTC

New

AI>In The News

LLM function calls don't scale; code orchestration is simpler, more effective

One common practice for working with MCP tools calls is to put the outputs from a tool back into the LLM as a message, and ask the LLM fo...

jngiam.bearblog.dev

#code

0 156 0

2025-05-22 03:37:38 UTC

New

AI>In The News

LLM Codegen go Brrr – Parallelization with Git Worktrees and Tmux | Category | Trieve

If you’re underwhelmed with AI coding agents or simply want to get more out of them, give parallelization a try. After seeing the results...

skeptrune.com

#git

0 128 0

2025-05-29 04:09:26 UTC

New

General Dev>In The News

My LLM codegen workflow atm

My LLM codegen workflow atm. A detailed walkthrough of my current workflow for using LLms to build software, from brainstorming through ...

harper.blog

#llm

0 84 0

2025-02-24 06:52:06 UTC

New

General Dev>In The News

Offline Reinforcement Learning for LLM Multi-Step Reasoning

Offline Reinforcement Learning for LLM Multi-Step Reasoning. Improving the multi-step reasoning ability of large language models (LLMs) ...

arxiv.org

#learning #llm

0 54 0

2024-12-23 17:08:43 UTC

New

General Dev>In The News

GitHub - NVIDIA/garak: the LLM vulnerability scanner

GitHub - NVIDIA/garak: the LLM vulnerability scanner. the LLM vulnerability scanner. Contribute to NVIDIA/garak development by creating ...

github.com

#nvidia #github #llm

0 70 0

2024-11-17 18:44:12 UTC

New

Backend>Blogs/Talks

Thinking Elixir 228 - From Surveys to Cheat Sheets

Episode 228 of Thinking Elixir. News includes Theo releasing his ElixirConf presentation video on his channel, the launch of the Gleam an...

youtube.com

#ai /elixir #conference #podcasts #llm

1 75 0

2024-11-12 15:08:26 UTC

New

Last Three Year's Trending

General Dev>In The News

Self-Retrieval: Building an information retrieval system with one LLM

Self-Retrieval: Building an Information Retrieval System with One Large Language Model. The rise of large language models (LLMs) has tra...

arxiv.org

#llm

1 297 0

2024-03-09 18:39:37 UTC

New

General Dev>In The News

PyTorch Library for Running LLM on Intel CPU and GPU

GitHub - intel-analytics/ipex-llm: Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Baichuan, Mixtral, Gemma...

github.com

#library #intel #cpu #llm

0 285 0

2024-04-03 18:41:11 UTC

New

General Dev>In The News

LoRA from scratch: implementation for LLM finetuning

Code LoRA from Scratch - a Lightning Studio by sebastian. LoRA (Low-Rank Adaptation) is a popular technique to finetune LLMs more effici...

lightning.ai

#llm

0 331 0

2024-01-24 18:17:40 UTC

New

General Dev>In The News

Implementing a ChatGPT-like LLM from scratch, step by step

GitHub - rasbt/LLMs-from-scratch: Implementing a ChatGPT-like LLM from scratch, step by step. Implementing a ChatGPT-like LLM from scrat...

github.com

#chatgpt #llm

2 318 2

2024-01-30 03:51:30 UTC

New

General Dev>In The News

LLM Paper on Mamba MoE: Jamba Technical Report from AI2

Jamba: A Hybrid Transformer-Mamba Language Model. We present Jamba, a new base large language model based on a novel hybrid Transformer-...

arxiv.org

#paper #llm

0 268 0

2024-04-02 02:18:51 UTC

New

General Dev>In The News

ScrapeGraphAI: Web scraping using LLM and direct graph logic

Hello from Scrapegraph-ai | Scrapegraph-ai. Official documentation of Scrapegraph-ai

scrapegraph-doc.onrender.com

#web #graph #llm

0 243 0

2024-05-08 04:20:22 UTC

New

General Dev>In The News

Benchmarks and comparison of LLM AI models and API hosting providers

Home | ArtificialAnalysis.ai. Analysis of AI models and hosting providers - choose the best model and provider for your use case

artificialanalysis.ai

#api #hosting #llm

0 278 0

2024-01-17 04:53:52 UTC

New

Backend>Blogs/Talks

Thinking Elixir 185 - InstructorEx for LLMs

Episode 185 of Thinking Elixir. Dive into the world of structured LLM prompting with our latest guest who shares insights on their innova...

podcast.thinkingelixir.com

#ai /elixir #podcasts #json #chatgpt #llm

1 282 0

2024-01-16 13:34:15 UTC

New

General Dev>In The News

Show HN: Kindllm – LLM chat optimized for Kindle e-readers

Kindllm - LLM chat for Kindle. The distraction-free LLM chat app for Kindle

kindllm.app

#chat #llm

0 263 0

2024-01-18 18:28:52 UTC

New

General Dev>In The News

AMD's MI300X Outperforms Nvidia's H100 for LLM Inference

AMD’s MI300X Outperforms NVIDIA’s H100 for LLM Inference. Discover if AMD’s MI300X accelerator can outperform NVIDIA’s H100 in real-worl...

blog.tensorwave.com

#nvidia #amd #llm

0 222 0

2024-06-14 19:27:20 UTC

New

General Dev>In The News

Implementation of Google's Griffin Architecture – RNN LLM

GitHub - google-deepmind/recurrentgemma: Open weights language model from Google DeepMind, based on Griffin… Open weights language model...

github.com

#google #llm

0 220 0

2024-04-11 03:42:34 UTC

New

General Dev>In The News

Building an early warning system for LLM-aided biological threat creation

Building an early warning system for LLM-aided biological threat creation. We’re developing a blueprint for evaluating the risk that a l...

openai.com

#llm

0 244 0

2024-02-01 04:05:27 UTC

New

General Dev>In The News

Get consistent data from your LLM with JSON Schema

Get consistent data from your LLM with JSON Schema. How to parse content from a tool that is made to speak in human sentences.

thoughtbot.com

#json #llm

0 241 0

2024-02-17 19:53:42 UTC

New

Backend>Blogs/Talks

Easy at-home AI with Bumblebee and Fly GPUs

A big barrier to getting started with local AI development is access to hardware. And by “local”, I mean having direct access to a GPU an...

fly.io

#ai /elixir #blog-post #development #llm

1 225 0

2024-04-02 13:15:16 UTC

New

General Dev>In The News

LLM inference speed of light

LLM inference speed of light. In the process of working on calm, a minimal from-scratch fast CUDA implementation of transformer-based la...

zeux.io

#llm

0 213 0

2024-03-17 16:17:38 UTC

New

Trending Over Three Years

General Dev>In The News

Maxtext: A simple, performant and scalable Jax LLM

GitHub - google/maxtext: A simple, performant and scalable Jax LLM!. A simple, performant and scalable Jax LLM! Contribute to google/max...

github.com

#llm

0 195 0

2024-04-24 09:35:00 UTC

New

General Dev>In The News

What would an LLM OS look like?

What would an LLM OS look like?. Andrej Karpathy’s YouTube channel is fantasic. He just published an Intro to Large Language Models vide...

campedersen.com

#llm

0 229 0

2024-03-15 06:21:38 UTC

New

General Dev>In The News

Aya: An open LLM by 3k independent researchers across the globe

Aya. Cohere’s non-profit research lab, C4AI, released the Aya model, a state-of-the-art, open source, massively multilingual, research L...

cohere.com

#llm

0 221 0

2024-02-13 21:19:12 UTC

New

General Dev>In The News

Rule-based NLP system beats LLM for analysis of psychiatric clinical notes

Extracting Social Support and Social Isolation Information from Clinical Psychiatry Notes: Comparing a Rule-based NLP System and a Large ...

arxiv.org

#llm

0 215 0

2024-04-05 17:13:05 UTC

New

General Dev>In The News

DRINK ME: (Ab)Using a LLM to compress text

DRINK ME: (Ab)Using a LLM to compress text. Introduction Large language models are trained on huge datasets of text to learn the relat...

o565.com

#llm

0 202 0

2024-05-03 15:48:20 UTC

New

General Dev>In The News

MM1: Methods, Analysis and Insights from Multimodal LLM Pre-training

MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training. In this work, we discuss building performant Multimodal Large La...

arxiv.org

#training #llm

0 209 0

2024-03-18 02:52:42 UTC

New

General Dev>In The News

Instruction-Based Image Editing via LLM

GitHub - apple/ml-mgie. Contribute to apple/ml-mgie development by creating an account on GitHub.

github.com

#llm

0 184 0

2024-02-07 05:05:51 UTC

New

General Dev>In The News

Can GPT optimize my taxes? An experiment in letting the LLM be the UX

Can GPT Optimize My Taxes?. TL;DR Yep.

finedataproducts.com

#llm

0 186 0

2024-04-02 02:19:46 UTC

New

General Dev>In The News

Qwen2 LLM Released

Hello Qwen2. GITHUB HUGGING FACE MODELSCOPE DEMO DISCORD Introduction After months of efforts, we are pleased to announce the evolution...

qwenlm.github.io

#llm

0 141 0

2024-06-07 00:14:14 UTC

New

General Dev>In The News

Your LLM Is a Capable Regressor When Given In-Context Examples

From Words to Numbers: Your Large Language Model Is Secretly A Capable Regressor When Given In-Context Examples. We analyze how well pre...

arxiv.org

#llm

0 171 0

2024-04-13 13:18:12 UTC

New

General Dev>In The News

How to think about creating a dataset for LLM fine-tuning evaluation

Alex Strick van Linschoten - How to think about creating a dataset for LLM finetuning evaluation. I summarise the kinds of evaluations t...

mlops.systems

#llm

0 158 0

2024-06-27 15:02:24 UTC

New

General Dev>In The News

LLM in a flash: Efficient Large Language Model Inference with Limited Memory

arxiv.org

#llm

0 150 0

2024-04-23 06:00:08 UTC

New

General Dev>In The News

Citation Needed – Wikimedia Foundation's Experimental LLM/RAG Chrome Extension

Wikipedia Citation Needed. A chrome extension for finding citations in Wikipedia by using ChatGPT

chromewebstore.google.com

#chrome #llm

0 146 0

2024-05-12 14:37:05 UTC

New

General Dev>In The News

How to run an LLM on your PC, not in the cloud, in less than 10 minutes

How to run an LLM locally on your PC in less than 10 minutes. Cut through the hype, keep your data private, find out what all the fuss i...

theregister.com

#llm

0 125 0

2024-06-24 02:27:18 UTC

New

General Dev>In The News

Top Libraries to Accelerate LLM Building

Top 9 Libraries to Accelerate LLM Building. The Open-source Tool Stack to build, scale, test, deploy, and monitor LLMs in 2024.

blog.aiport.tech

#llm

0 115 0

2024-06-24 02:26:01 UTC

New

Follow
Join
Shape

the conversation

+Thread Join Now

Thinking Elixir Podcast 260: Cheaper testing with AI?

0 0

2025-07-08 12:22:34 UTC

Open WebUI: self-hosted AI platform designed to operate entirely offline

0 0

2025-07-07 05:23:54 UTC

LLMS & Elixir: Windfall or Deathblow?

0 1

As an Experienced LLM User, I Actually Don't Use Generative LLMs Often

1 1

Thinking Elixir 239 - Scaling to Unicorn Status

0 2

DeepSeek - the free, open source “ChatGPT killer”

6 3

Latest on Devtalk

Ralph Wiggum as a "software engineer"

General Dev>In The News

How to scale RL to 10^26 FLOPs

General Dev>In The News

V weekly.2025.29 released!

Backend>Official News

Clojure Deref (July 14, 2025)

Backend>Official News

Vapor 4.115.1 released!

Backend>Official News

Fable 4.26.0 released!

Frontend>Official News

Two guys hated using Comcast, so they built their own fiber ISP

General Dev>In The News

Reverse engineering and generation toolkit for Chrome's private x-browser-validation header, used for integrity

General Dev>In The News

Julia v1.12.0-rc1 released!

Backend>Official News

The upcoming GPT-3 moment for RL

AI>In The News

How does a screen work?

General Dev>In The News

A Technical Look at Iran’s Internet Shutdowns

General Dev>In The News

Programming affordance: when a language's patterns make it natural to make mistakes

General Dev>In The News

Switching to Claude Code + VSCode inside Docker

AI>In The News

Personal night light exposure predicts incidence of cardiovascular diseases in individuals

Science/Tech>Health & Diet

Happy 20th birthday Django!

Backend>Official News

Context Engineering Guide

AI>In The News

Vibe-Coding a PCB - surprisingly good

AI>In The News

LLM Inference in Production

AI>In The News

Apple Vs The Law

macOS>In The News

From The Mac to The Mystical: Bill Atkinson’s Psychedelic User Interface

macOS>In The News

Recovering from AI Addiction – Internet and Technology Addicts Anonymous

AI>In The News

Running Local LLMs with Ollama on openSUSE Tumbleweed

Linux>Official News

5 things I learned from 5 years at Vercel | Lee Robinson

General Dev>In The News

AI Agent Benchmarks are Broken

AI>In The News

We’re Light-Years Away from True Artificial Intelligence, Says Murderbot Author Martha Wells

AI>In The News

Hanami and the elephant in the room

Backend>Official News

Researchers create 3D interactive digital room from simple video | Cornell Chronicle

General Dev>In The News

Not So Fast: AI Coding Tools Can Actually Reduce Productivity

AI>In The News

Australia is quietly rolling out age checks for search engines like Google

General Dev>In The News

Devtalk ❯

Similar Portals

None added yet

Get money off!

The Pragmatic Bookshelf

35% off any eBook

Manning Publications

45% off any item

The Pragmatic Studio

20% off any course

Simply use coupon code "devtalk.com" at checkout. Where applicable this coupon can be used for an many items and as many times as you like!

We ❤️ helpful members!

We reward our most helpful members via our MOTM scheme - by giving away a whopping 25 books per year!

Filter by Type:

We're in Beta

About us Mission Statement See our Roadmap