LogoPromocode
LatestAgentOpenAILLMAbout

DEEPSEEK

GRPO (Group Relative Policy Optimization) Study Notes
GRPOMarch 4, 2025

GRPO (Group Relative Policy Optimization) Study Notes

We introduce Group Relative Policy Optimization (GRPO), a variant of Proximal Policy Optimization (PPO)

DeepSeek #OpenSourceWeek - Five Consecutive Releases
DEEPSEEKFebruary 28, 2025

DeepSeek #OpenSourceWeek - Five Consecutive Releases

We're a tiny team @deepseek_ai exploring AGI.

DeepSeek-R1 - Andrej Karpathy in-depth explanation of LLM (Part 9)
ANDREJFebruary 24, 2025

DeepSeek-R1 - Andrej Karpathy in-depth explanation of LLM (Part 9)

DeepSeek R1

How Different Large Models Like DeepSeek R1/ChatGPT o3/Grok3 Think
LLMFebruary 19, 2025

How Different Large Models Like DeepSeek R1/ChatGPT o3/Grok3 Think

LLM Think

Andrej Karpathy in-depth explanation of large language model (LLM) technology (Part 1) - [Pretraining and Inference]
DEEPFebruary 11, 2025

Andrej Karpathy in-depth explanation of large language model (LLM) technology (Part 1) - [Pretraining and Inference]

- introduction - pretraining data (internet) - tokenization - neural network I/O - neural network internals - inference

DeepSeek Janus Series: Unified Multimodal Understanding and Generation Models
DEEPSEEKJanuary 28, 2025

DeepSeek Janus Series: Unified Multimodal Understanding and Generation Models

Janus-Series: Unified Multimodal Understanding and Generation Models

Comparison of the reasoning processes between ChatGPT o1 pro and DeepSeek R1
DEEPSEEKJanuary 27, 2025

Comparison of the reasoning processes between ChatGPT o1 pro and DeepSeek R1

DeepSeek R1 Vs ChatGPT 01 (My Experience)

DeepSeek R1: X.com User Reviews
DEEPSEEKJanuary 26, 2025

DeepSeek R1: X.com User Reviews

Deepseek-r1 is open source and on par with o1 preview - @bindureddy

Paper of DeepSeek-R1: Exploration and Breakthrough of the New Generation Inference Model
LLMJanuary 25, 2025

Paper of DeepSeek-R1: Exploration and Breakthrough of the New Generation Inference Model

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

POPULAR

AI

Messari Report Translation and Summary 【1-7 Surviving the Winter】

AI

IMAGDressing

AI

Solidity From Beginner to Give Up (1) - Variable Types

AI

Runway's Gen-3 Alpha: The Latest AI-Powered Video Generation Model

DEEPSEEK

DeepSeek Janus Series: Unified Multimodal Understanding and Generation Models

LLM

See More

AlphaGo and the Power of Reinforcement Learning - Andrej Karpathy's Deep Dive on LLMs (Part 9)

DEEPMarch 21, 2025

Reinforcement Learning from Human Feedback (RLHF) - Andrej Karpathy's Deep Dive on LLMs (Part 10)

DEEPMarch 22, 2025

The Future of Large Language Models - Andrej Karpathy's In-Depth Explanation of LLM (Part 11)

DEEPMarch 23, 2025

DEEPSEEK

Learn about DeepSeek's innovative approaches to AI research and their contributions to the field.

GRPO (Group Relative Policy Optimization) Study Notes
GRPOMarch 4, 2025

GRPO (Group Relative Policy Optimization) Study Notes

We introduce Group Relative Policy Optimization (GRPO), a variant of Proximal Policy Optimization (PPO)

DeepSeek #OpenSourceWeek - Five Consecutive Releases
DEEPSEEKFebruary 28, 2025

DeepSeek #OpenSourceWeek - Five Consecutive Releases

We're a tiny team @deepseek_ai exploring AGI.

DeepSeek-R1 - Andrej Karpathy in-depth explanation of LLM (Part 9)
ANDREJFebruary 24, 2025

DeepSeek-R1 - Andrej Karpathy in-depth explanation of LLM (Part 9)

DeepSeek R1

How Different Large Models Like DeepSeek R1/ChatGPT o3/Grok3 Think
LLMFebruary 19, 2025

How Different Large Models Like DeepSeek R1/ChatGPT o3/Grok3 Think

LLM Think

Andrej Karpathy in-depth explanation of large language model (LLM) technology (Part 1) - [Pretraining and Inference]
DEEPFebruary 11, 2025

Andrej Karpathy in-depth explanation of large language model (LLM) technology (Part 1) - [Pretraining and Inference]

- introduction - pretraining data (internet) - tokenization - neural network I/O - neural network internals - inference

DeepSeek Janus Series: Unified Multimodal Understanding and Generation Models
DEEPSEEKJanuary 28, 2025

DeepSeek Janus Series: Unified Multimodal Understanding and Generation Models

Janus-Series: Unified Multimodal Understanding and Generation Models

Comparison of the reasoning processes between ChatGPT o1 pro and DeepSeek R1
DEEPSEEKJanuary 27, 2025

Comparison of the reasoning processes between ChatGPT o1 pro and DeepSeek R1

DeepSeek R1 Vs ChatGPT 01 (My Experience)

DeepSeek R1: X.com User Reviews
DEEPSEEKJanuary 26, 2025

DeepSeek R1: X.com User Reviews

Deepseek-r1 is open source and on par with o1 preview - @bindureddy

Paper of DeepSeek-R1: Exploration and Breakthrough of the New Generation Inference Model
LLMJanuary 25, 2025

Paper of DeepSeek-R1: Exploration and Breakthrough of the New Generation Inference Model

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

POPULAR

AI

Messari Report Translation and Summary 【1-7 Surviving the Winter】

AI

IMAGDressing

AI

Solidity From Beginner to Give Up (1) - Variable Types

AI

Runway's Gen-3 Alpha: The Latest AI-Powered Video Generation Model

DEEPSEEK

DeepSeek Janus Series: Unified Multimodal Understanding and Generation Models

AI TOOLS

ChatGPTGeminiDeepSeekGrokElevenLabsClaude

LLM

See More

AlphaGo and the Power of Reinforcement Learning - Andrej Karpathy's Deep Dive on LLMs (Part 9)

DEEPMarch 21, 2025

Reinforcement Learning from Human Feedback (RLHF) - Andrej Karpathy's Deep Dive on LLMs (Part 10)

DEEPMarch 22, 2025

The Future of Large Language Models - Andrej Karpathy's In-Depth Explanation of LLM (Part 11)

DEEPMarch 23, 2025

GOOGLE

See More

Trial of Google's video generation model VOE2

GOOGLEMarch 23, 2025

Gemini 2.5 Pro, claimed to be far ahead of the competition, has been released with great fanfare: comprehensively surpassing other LLMs and topping the global rankings

LLMMarch 26, 2025

AI-Researcher: LLM-driven全自动 scientific research assistant

AIMarch 30, 2025

SUBSCRIBE

All our premium content and latest news delivered straight to your inbox

PROMOCODE

LATESTAGENTOPENAILLMGOOGLENVIDIADEEPSEEKOCRCHATGPTGENERATORCLAUDEABOUT

© 2024 Promocode. ALL RIGHTS RESERVED

PROMOCODE

© 2024 Promocode. ALL RIGHTS RESERVED.

LATESTAGENTOPENAILLM
GOOGLENVIDIADEEPSEEKOCR
CHATGPTGENERATORCLAUDEABOUT