最近这段时间购房、搬家、装修,疏于更新了。🙏
分享这篇看到过最好的关于 transformer 的综述之一:https://deeprevision.github.io/posts/001-transformer
#llm
分享这篇看到过最好的关于 transformer 的综述之一:https://deeprevision.github.io/posts/001-transformer
#llm
deeprevision.github.io
AI Research Blog - The Transformer Blueprint: A Holistic Guide to the Transformer Neural Network Architecture
A deep dive into Transformer, a neural network architecture that was introduced in the famous paper “attention is all you need” in 2017, its applications, impacts, challenges and future directions
TL;DR:基于 LLM 开发上层应用时大概率不需要 fine-tune 模型 — 通过各种技巧来提供领域特定的 context 是更为有效和低成本的做法。
https://www.tidepool.so/2023/08/17/why-you-probably-dont-need-to-fine-tune-an-llm/
#llm
https://www.tidepool.so/2023/08/17/why-you-probably-dont-need-to-fine-tune-an-llm/
#llm
www.tidepool.so
Why You (Probably) Don’t Need to Fine-tune an LLM
This post is targeted towards folks focused on building LLM (Large Language Model) applications (as opposed to research).