Parallel Experiments
1.7K subscribers
62 photos
1 video
3 files
811 links
Stay informed. Stay authentic.

Welcome to the public part of my brain. Here I share curations and thoughts.

Created with ❤️ by @linghao.
Download Telegram
小彩蛋:花了五分钟让 R1 模仿《盲视》作者 Peter Watts 的风格写的科幻小段子。
Cogito ergo crash 🤣
🥰7
How many of you don't understand Chinese and only follow this channel for English content?
Anonymous Poll
8%
Yes, I can only read English.
92%
No, I can read Chinese.
1
前段时间准备 ML Interview (with a focus on LLMs),浏览了不少学习资源,这里分享一些:

CMU 11-711 Advanced NLP

Language Modeling 综述。

The Transformer Blueprint: A Holistic Guide to the Transformer Neural Network Architecture

比较好的一篇 Transformer 综述。

3Blue1Brown: Attention in transformers, step-by-step

解释 Attention 最好的视频,没有之一。

Hugging Face: Mixture of Experts Explained

Hugging Face: RLHF

Hugging Face: Introduction to Deep Reinforcement Learning

Hugging Face: Multimodal Models

HF 这几个资源很适合快速查漏补缺相关的话题。

Lilian Weng: Agents

依然是最好的 Agents 综述之一。

Understanding Reasoning LLMs

一些 post-training 的细节,侧重分析了 DeepSeek R1 和 R1 Zero。

Designing Machine Learning Systems 笔记 by @tms_ur_way

适合快速查漏补缺 ML 实践中的要点。

Stable Diffusion Explained From Scratch

关于 Diffusion 基本原理的解释。



除此之外以下这几位的内容都很不错,可以针对话题有选择性地摄入。

- Andrej Karpathy 的 YouTube 视频
- Lilian Weng 的博客
- Chip Huyen 的博客

这里推荐的基本都比较入门 / high level,更多是为了查漏补缺。要深度挖掘具体话题还是得去看进一步的资源和论文等。 #ml #llm
👍298
去 Netflix campus 听了个 ClickHouse 的 meetup,他们 CTO 为了 showcase,拿 ADS-B 数据做了一个炫酷的航天器轨迹可视化网站。细节很多,包括有意思的 pattern 以及实现细节,值得一看。

https://github.com/ClickHouse/adsb.exposed
👍1
https://a16z.com/a-deep-dive-into-mcp-and-the-future-of-ai-tooling/

Also, from Why MCP Won:
- MCP is "AI-Native" version of old idea
- MCP is an "open standard" with a big backer
- Anthropic has the best developer AI brand
- MCP based off LSP, an existing successful protocol
- MCP dogfooded with complete set of 1st party client, servers, tooling, SDKs
- MCP started with minimal base, but with frequent roadmap updates
Gemini 2.5 昨日发布。这条不是关于 model 本身,而是分享一则 HN 上相关讨论区提到的有趣数学 puzzle [1]。po 主声称 Gemini 2.5 是第一个能一次答对这道题的模型。题面见下:

There's three people in a circle. Each person has a positive integer floating above their heads, such that each person can see the other two numbers but not his own. The sum of two of the numbers is equal to the third. The first person is asked for his number, and he says that he doesn't know. The second person is asked for his number, and he says that he doesn't know. The third person is asked for his number, and he says that he doesn't know. Then, the first person is asked for his number again, and he says: 65. What is the product of the three numbers?


答案在这里:[2]

[1] https://news.ycombinator.com/item?id=43473489
[2] https://www.reddit.com/r/math/comments/32m611/logic_question_that_has_me_stumped/
🤔31
Forwarded from C’s Random Collection
https://ai-2027.com “We predict that the impact of superhuman AI over the next decade will be enormous, exceeding that of the Industrial Revolution.” 不管怎样,这个页面的 interaction 很棒 #ai
🤩1
发现一个非常好用的 Obsidian 插件:https://github.com/RyotaUshio/obsidian-pdf-plus

通过 backlink 实现不出 Obsidian 就能给 PDF 做标注和笔记,并且笔记还可以分散在多个文件中,设计得相当 Obsidian native。

#obsidian
2
A really good and concise deep dive into RLHF in LLM post-training, Proximal Policy Optimization (PPO), and Group Relative Policy Optimization (GRPO)
https://yugeten.github.io/posts/2025/01/ppogrpo/
#llm