Hot in DeepSeek In The News:
HOT IN DeepSeek In The News THIS WEEK!
HOT IN DeepSeek In The News THIS MONTH!
HOT IN DeepSeek In The News THIS YEAR!

This is cool!
DEEPSEEK-V3 ON M4 MAC: BLAZING FAST INFERENCE ON APPLE SILICON
We just witnessed something incredible: the largest open-s...
New

We’re a tiny team @deepseek-ai pushing our limits in AGI exploration.
Starting this week , Feb 24, 2025 we’ll open-source 5 repos – one ...
New

DeepSeek’s Multi-Head Latent Attention and Other KV Cache Tricks.
How a Key-Value (KV) cache reduces Transformer inference time by tradi...
New

This is probably one of the best technical (yet accessible) breakdowns around atm:
New

The Illustrated DeepSeek-R1.
A recipe for reasoning LLMs
New

Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial.
Reproduce Deepseek R1 „aha moment“ and train an open model using reinforcemen...
New

A high-performance distributed file system designed to address the challenges of AI training and inference workloads. - GitHub - deepsee...
New

YouTube Transcript Optimizer.
Automatically generate polished and beautiful documents from your YouTube videos!
New

DeepSeek is pushing DuckDB beyond its single-node roots with smallpond, a new, simple approach to distributed compute. But does it solve ...
New

A big day for short sellers after Nvidia loses $600 billion off its market value.
New

Nvidia sheds almost $600 billion in market cap, biggest one-day loss in U.S. history.
Nvidia shares plunged 17% on Monday, resulting in ...
New

Run DeepSeek-R1 Dynamic 1.58-bit.
DeepSeek R-1 is the most powerful open-source reasoning model that performs on par with OpenAI’s o1 mo...
New

OpenAI says faster, more accurate STEM-focused model will be free to all users.
New

The AI Fad Just Burned to the Waterline.
Sometimes wide moats and billions of dollars to blow lead not to glory but to hubris, which bec...
New

We present DeepSeek-V3, a strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token. To...
New
HOT IN DeepSeek In The News THE LAST THREE YEARS!

R1-Zero and R1 Results and Analysis.
An analysis of Deepseek’s R1
New

Nvidia’s $589 Billion DeepSeek Rout Is Largest in Market History.
(Bloomberg) – Nvidia Corp.’s plunge, fueled by investor concern about ...
New
HOT IN DeepSeek In The News THIS Over 3 Years!
DeepSeek
Classification:
Large Language Model
Forum Category:
AI
Threads:
25
Posts:
67
"Open source large language model made by DeepSeek."
- Follow
- Join
- Shape
the conversation
Latest Deep Seek Jobs
Deep Seek Events (WIP)
Get money off!

The Pragmatic Bookshelf
35% off any eBook

Manning Publications
45% off any item

The Pragmatic Studio
20% off any course
Simply use coupon code "devtalk.com" at checkout. Where applicable this coupon can be used for an many items and as many times as you like!

Filter by Type:
My Saved Portals
-
None saved yet
Similar Portals
-
None added yet