Parallel Experiments

https://huggingface.co/spaces/nanotron/ultrascale-playbook
Hugging Face 发布了 Scaling LLM Training on GPU 的 playbook，应该会比 DeepMind 那本侧重 TPU 的 scaling book 更普适一些。 #llm

huggingface.co

The Ultra-Scale Playbook - a Hugging Face Space by nanotron

The ultimate guide to training LLM on large GPU Clusters

🔥3

1.32K viewsLinghao Zhang, 20:32

Parallel Experiments

💃 上周在 Las Vegas Sphere 看的现场，赞爆
https://www.youtube.com/watch?v=DKvWHjQAGqo

Please open Telegram to view this post

VIEW IN TELEGRAM

YouTube

Anyma - Hypnotized (feat. Ellie Goulding) [Live from Sphere Las Vegas]

Ellie Goulding and Anyma perform “Hypnotized” live from Sphere Las Vegas.

Listen to “Hypnotized (feat. Ellie Goulding)” now: https://anyma-ellie.lnk.to/hypnotized

Follow Ellie:
Instagram: https://www.instagram.com/elliegoulding
TikTok: https://www.ti…

🔥1

924 viewsLinghao Zhang, 03:53

Parallel Experiments

前段时间准备 ML Interview (with a focus on LLMs)，浏览了不少学习资源，这里分享一些：

CMU 11-711 Advanced NLP

Language Modeling 综述。

The Transformer Blueprint: A Holistic Guide to the Transformer Neural Network Architecture

比较好的一篇 Transformer 综述。

3Blue1Brown: Attention in transformers, step-by-step

解释 Attention 最好的视频，没有之一。

Hugging Face: Mixture of Experts Explained

Hugging Face: RLHF

Hugging Face: Introduction to Deep Reinforcement Learning

Hugging Face: Multimodal Models

HF 这几个资源很适合快速查漏补缺相关的话题。

Lilian Weng: Agents

依然是最好的 Agents 综述之一。

Understanding Reasoning LLMs

一些 post-training 的细节，侧重分析了 DeepSeek R1 和 R1 Zero。

Designing Machine Learning Systems 笔记 by @tms_ur_way

适合快速查漏补缺 ML 实践中的要点。

Stable Diffusion Explained From Scratch

关于 Diffusion 基本原理的解释。

除此之外以下这几位的内容都很不错，可以针对话题有选择性地摄入。

- Andrej Karpathy 的 YouTube 视频
- Lilian Weng 的博客
- Chip Huyen 的博客

这里推荐的基本都比较入门 / high level，更多是为了查漏补缺。要深度挖掘具体话题还是得去看进一步的资源和论文等。 #ml #llm

👍29❤8

2.97K viewsLinghao Zhang, edited 19:22

Parallel Experiments

去 Netflix campus 听了个 ClickHouse 的 meetup，他们 CTO 为了 showcase，拿 ADS-B 数据做了一个炫酷的航天器轨迹可视化网站。细节很多，包括有意思的 pattern 以及实现细节，值得一看。

https://github.com/ClickHouse/adsb.exposed

GitHub

GitHub - ClickHouse/adsb.exposed: Interactive visualization and analytics on ADS-B data with ClickHouse

Interactive visualization and analytics on ADS-B data with ClickHouse - ClickHouse/adsb.exposed

👍1

1.12K viewsLinghao Zhang, edited 07:22

Parallel Experiments

Andreessen Horowitz

A Deep Dive Into MCP and the Future of AI Tooling | Andreessen Horowitz

We explore what MCP is, how it changes the way AI interacts with tools, what developers are already building, and the challenges that still need solving.

https://a16z.com/a-deep-dive-into-mcp-and-the-future-of-ai-tooling/

Also, from Why MCP Won:

- MCP is "AI-Native" version of old idea
- MCP is an "open standard" with a big backer
- Anthropic has the best developer AI brand
- MCP based off LSP, an existing successful protocol
- MCP dogfooded with complete set of 1st party client, servers, tooling, SDKs
- MCP started with minimal base, but with frequent roadmap updates

1.23K viewsLinghao Zhang, edited 00:46

Parallel Experiments

Pretty entertaining classical murder mystery set in the White House
https://www.imdb.com/title/tt8740614/

IMDb

The Residence (TV Mini Series 2025) ⭐ 7.7 | Comedy, Crime, Drama

50m | TV-MA

784 viewsLinghao Zhang, 22:22

Parallel Experiments

https://store.steampowered.com/app/2394650/Crypt_Custodian/
🎮 Yet another metroidvania. 手感蛮好的而且游戏很可爱。 #game

Steampowered

Save 35% on Crypt Custodian on Steam

Crypt Custodian is a charming metroidvania about cleaning up the afterlife. Play as Pluto - a mischievous cat who has died, and is sentenced to be the afterworld's janitor... FOREVER! Hang out with other doomed ghosts, battle beasts, and explore a vastly…

765 viewsLinghao Zhang, 18:00

Parallel Experiments

Gemini 2.5 昨日发布。这条不是关于 model 本身，而是分享一则 HN 上相关讨论区提到的有趣数学 puzzle [1]。po 主声称 Gemini 2.5 是第一个能一次答对这道题的模型。题面见下：

There's three people in a circle. Each person has a positive integer floating above their heads, such that each person can see the other two numbers but not his own. The sum of two of the numbers is equal to the third. The first person is asked for his number, and he says that he doesn't know. The second person is asked for his number, and he says that he doesn't know. The third person is asked for his number, and he says that he doesn't know. Then, the first person is asked for his number again, and he says: 65. What is the product of the three numbers?

答案在这里：[2]

[1] https://news.ycombinator.com/item?id=43473489
[2] https://www.reddit.com/r/math/comments/32m611/logic_question_that_has_me_stumped/

From the math community on Reddit: Logic question that has me stumped.

Explore this post and more from the math community

🤔3❤1

904 viewsLinghao Zhang, edited 19:05

Parallel Experiments

A easy-to-follow intro to Zero Knowledge Proof: https://youtu.be/Otvcbw6k4eo

YouTube

I can prove I’ve solved this Sudoku without revealing it

Support us on Patreon: http://patreon.com/polylog
I can convince you that I’ve solved a sudoku without giving you any information about my solution. We discuss how to do this using what cryptographers call a zero-knowledge proof, and how the same tricks…

967 viewsLinghao Zhang, 23:17

Parallel Experiments

四集每集都是一镜到底的迷你剧系列，反复欣赏！
https://www.imdb.com/title/tt31806037/

IMDb

Adolescence (TV Mini Series 2025) ⭐ 8.1 | Crime, Drama, Thriller

1h | TV-MA

1.03K viewsLinghao Zhang, 22:34

Parallel Experiments

Forwarded from C’s Random Collection

https://ai-2027.com “We predict that the impact of superhuman AI over the next decade will be enormous, exceeding that of the Industrial Revolution.” 不管怎样，这个页面的 interaction 很棒 #ai

Ai-2027

AI 2027

A research-backed AI scenario forecast.

🤩1

845 viewsLinghao Zhang, 06:41

Parallel Experiments

发现一个非常好用的 Obsidian 插件：https://github.com/RyotaUshio/obsidian-pdf-plus

通过 backlink 实现不出 Obsidian 就能给 PDF 做标注和笔记，并且笔记还可以分散在多个文件中，设计得相当 Obsidian native。

#obsidian

❤2

1.16K viewsLinghao Zhang, 21:21

Parallel Experiments

A really good and concise deep dive into RLHF in LLM post-training, Proximal Policy Optimization (PPO), and Group Relative Policy Optimization (GRPO)
https://yugeten.github.io/posts/2025/01/ppogrpo/
#llm

819 viewsLinghao Zhang, edited 02:24

Parallel Experiments

https://www.anthropic.com/research/tracing-thoughts-language-model
Anthropic 这个 LLM Interpretability 的研究得到了不少有趣的结论。想要 TLDR 可以读这篇博客；有兴趣可以看看两篇对应的论文，有更多细节并且页面交互做得不错。 #llm

https://transformer-circuits.pub/2025/attribution-graphs/biology.html
https://transformer-circuits.pub/2025/attribution-graphs/methods.html

Anthropic

Tracing the thoughts of a large language model

Anthropic's latest interpretability research: a new microscope to understand Claude's internal mechanisms

879 viewsLinghao Zhang, 21:37

About

Blog

Apps

Platform