Latest #llm Threads Top

CommunityNews
If you’re underwhelmed with AI coding agents or simply want to get more out of them, give parallelization a try. After seeing the results...
New
CommunityNews
One common practice for working with MCP tools calls is to put the outputs from a tool back into the LLM as a message, and ask the LLM fo...
New
CommunityNews
CocoIndex now supports knowledge graph with incremental processing. Build live knowledge for agents is super easy with CocoIndex!
New
New
New
CommunityNews
In this technical report, we tackle the challenges of training large-scale Mixture of Experts (MoE) models, focusing on overcoming cost i...
New
CommunityNews
Open Euro LLM. A series of foundation models for transparent AI in Europe
New
CommunityNews
My LLM codegen workflow atm. A detailed walkthrough of my current workflow for using LLms to build software, from brainstorming through ...
New
brainlid
Episode 239 of Thinking Elixir. News includes an impressive case study from Remote showing how they scaled Elixir to support nearly 300 e...
New
CommunityNews
OpenAI o3-mini, now available in LLM. OpenAI’s o3-mini is out today. As with other o-series models it’s a slightly difficult one to eval...
New

This Week's Trending Top

CommunityNews
If you’re underwhelmed with AI coding agents or simply want to get more out of them, give parallelization a try. After seeing the results...
New

This Month's Trending Top

New
CommunityNews
CocoIndex now supports knowledge graph with incremental processing. Build live knowledge for agents is super easy with CocoIndex!
New
CommunityNews
One common practice for working with MCP tools calls is to put the outputs from a tool back into the LLM as a message, and ask the LLM fo...
New

This Year's Trending Top

AstonJ
Loads of news stories about DeepSeek here in the last few days, no surprise as it’s been making headlines across the world! Currently a h...
New
CommunityNews
AMD’s MI300X Outperforms NVIDIA’s H100 for LLM Inference. Discover if AMD’s MI300X accelerator can outperform NVIDIA’s H100 in real-worl...
New
CommunityNews
Hello Qwen2. GITHUB HUGGING FACE MODELSCOPE DEMO DISCORD Introduction After months of efforts, we are pleased to announce the evolution...
New
CommunityNews
Alex Strick van Linschoten - How to think about creating a dataset for LLM finetuning evaluation. I summarise the kinds of evaluations t...
New
CommunityNews
Use Prolog to improve LLM’s reasoning. On one side, LLMs show unseen capabilities in reasoning, but on the other - reasoning in LLMs is ...
New
CommunityNews
In this technical report, we tackle the challenges of training large-scale Mixture of Experts (MoE) models, focusing on overcoming cost i...
New
CommunityNews
How to run an LLM locally on your PC in less than 10 minutes. Cut through the hype, keep your data private, find out what all the fuss i...
New
New
CommunityNews
Top 9 Libraries to Accelerate LLM Building. The Open-source Tool Stack to build, scale, test, deploy, and monitor LLMs in 2024.
New
CommunityNews
Offline Reinforcement Learning for LLM Multi-Step Reasoning. Improving the multi-step reasoning ability of large language models (LLMs) ...
New
CommunityNews
My LLM codegen workflow atm. A detailed walkthrough of my current workflow for using LLms to build software, from brainstorming through ...
New
CommunityNews
GitHub - NVIDIA/garak: the LLM vulnerability scanner. the LLM vulnerability scanner. Contribute to NVIDIA/garak development by creating ...
New
brainlid
Episode 239 of Thinking Elixir. News includes an impressive case study from Remote showing how they scaled Elixir to support nearly 300 e...
New
CommunityNews
GitHub - samuel-vitorino/lm.rs: Minimal LLM inference in Rust. Minimal LLM inference in Rust. Contribute to samuel-vitorino/lm.rs develo...
New
CommunityNews
Forest Friends Zine. A guide for AI Engineers building the wild world of LLM system evals
New

Last Three Year's Trending Top

CommunityNews
Self-Retrieval: Building an Information Retrieval System with One Large Language Model. The rise of large language models (LLMs) has tra...
New
CommunityNews
Code LoRA from Scratch - a Lightning Studio by sebastian. LoRA (Low-Rank Adaptation) is a popular technique to finetune LLMs more effici...
New
CommunityNews
GitHub - intel-analytics/ipex-llm: Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Baichuan, Mixtral, Gemma...
New
First poster: dani
GitHub - rasbt/LLMs-from-scratch: Implementing a ChatGPT-like LLM from scratch, step by step. Implementing a ChatGPT-like LLM from scrat...
New
CommunityNews
Jamba: A Hybrid Transformer-Mamba Language Model. We present Jamba, a new base large language model based on a novel hybrid Transformer-...
New
CommunityNews
Hello from Scrapegraph-ai | Scrapegraph-ai. Official documentation of Scrapegraph-ai
New
brainlid
Episode 185 of Thinking Elixir. Dive into the world of structured LLM prompting with our latest guest who shares insights on their innova...
New
CommunityNews
Home | ArtificialAnalysis.ai. Analysis of AI models and hosting providers - choose the best model and provider for your use case
New
CommunityNews
GitHub - google-deepmind/recurrentgemma: Open weights language model from Google DeepMind, based on Griffin… Open weights language model...
New
CommunityNews
Kindllm - LLM chat for Kindle. The distraction-free LLM chat app for Kindle
New
CommunityNews
Building an early warning system for LLM-aided biological threat creation. We’re developing a blueprint for evaluating the risk that a l...
New
CommunityNews
Get consistent data from your LLM with JSON Schema. How to parse content from a tool that is made to speak in human sentences.
New
brainlid
A big barrier to getting started with local AI development is access to hardware. And by “local”, I mean having direct access to a GPU an...
New
CommunityNews
LLM inference speed of light. In the process of working on calm, a minimal from-scratch fast CUDA implementation of transformer-based la...
New
CommunityNews
What would an LLM OS look like?. Andrej Karpathy’s YouTube channel is fantasic. He just published an Intro to Large Language Models vide...
New

Trending Over Three Years Top

CommunityNews
Aya. Cohere’s non-profit research lab, C4AI, released the Aya model, a state-of-the-art, open source, massively multilingual, research L...
New
CommunityNews
MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training. In this work, we discuss building performant Multimodal Large La...
New
CommunityNews
DRINK ME: (Ab)Using a LLM to compress text. Introduction Large language models are trained on huge datasets of text to learn the relat...
New
CommunityNews
Extracting Social Support and Social Isolation Information from Clinical Psychiatry Notes: Comparing a Rule-based NLP System and a Large ...
New
CommunityNews
GitHub - apple/ml-mgie. Contribute to apple/ml-mgie development by creating an account on GitHub.
New
CommunityNews
GitHub - google/maxtext: A simple, performant and scalable Jax LLM!. A simple, performant and scalable Jax LLM! Contribute to google/max...
New
CommunityNews
From Words to Numbers: Your Large Language Model Is Secretly A Capable Regressor When Given In-Context Examples. We analyze how well pre...
New
New
CommunityNews
Wikipedia Citation Needed. A chrome extension for finding citations in Wikipedia by using ChatGPT
New
New
NewsBot
A new Go blog post/announcement has been posted! Get the full details here: Building LLM-powered applications in Go - The Go Programmin...
New
CommunityNews
Rethinking LLM Inference: Why Developer AI Needs a Different Approach. A technical blog post from Augment Code explaining their approach...
New
brainlid
Episode 228 of Thinking Elixir. News includes Theo releasing his ElixirConf presentation video on his channel, the launch of the Gleam an...
New
CommunityNews
OpenAI o3-mini, now available in LLM. OpenAI’s o3-mini is out today. As with other o-series models it’s a slightly difficult one to eval...
New
CommunityNews
Does your LLM truly unlearn? An embarrassingly simple approach to recover unlearned knowledge. Large language models (LLMs) have shown r...
New
  • Follow
  • Join
  • Shape
the conversation

Latest on Devtalk

Devtalk

Similar Portals

    None added yet

Get money off!

The Pragmatic Bookshelf

35% off any eBook

Manning Publications

45% off any item

The Pragmatic Studio

20% off any course

Simply use coupon code "devtalk.com" at checkout. Where applicable this coupon can be used for an many items and as many times as you like!