Learning

学习资料

如果你刚进入实验室，可以先从分类学习资料开始；如果已经确定研究方向，可以直接进入 A、B、C 三个方向的阅读入口。

当前数据源：google-sheet

Reading Guide

How to Read a Paper

Author: S. Keshav

如果你刚开始进入科研阅读，推荐先读这篇经典短文。它提出了常用的“三遍读论文法”，可以帮助你先快速判断一篇论文值不值得深入读，再逐步理解方法、实验和可复现细节。

打开论文

Google Sheet Sync

通过 Google Sheet 修改并同步

如需维护学习资料，可通过共享 Google Sheet 同步整理方向入口和分类条目。当前页面优先读取 Google Sheet；如共享表暂时不可用，会自动回退到仓库内备份数据。

说明：修改后不会立即同步，当前通过 GitHub Actions 按 15 分钟定时构建与同步更新。

打开 Google Sheet

direction_entries：三大研究方向的论文与代码仓库
learning_resources：分类学习资料条目

Direction Entry

方向学习入口

建议按“先看综述，再看主线，再看专题，最后做代码复现”的顺序学习。你可以先用综述建立整体认识，再通过主线论文把握核心问题与系统方法，接着根据兴趣进入专题阅读，最后结合共享 Google Sheet 中当前维护的代码仓库条目做实现与复现。

方向 A

多智能体协同感知

协同感知多模态UAV视觉语言导航

代表性论文 / 综述

Collaborative Perception in Autonomous Driving: Methods, Datasets, and Challenges [链接] · 入门时建议先看一篇基础综述，再看几篇经典论文（如序号2-7），之后再看新论文
Where2comm: Communication-Efficient Collaborative Perception via Spatial Confidence Maps [链接] · 经典论文（通信高效）
OPV2V: An Open Benchmark Dataset and Fusion Pipeline for Perception with Vehicle-to-Vehicle Communication [链接] · 经典论文（数据集）
An Extensible Framework for Open Heterogeneous Collaborative Perception [链接] · 经典论文（异构协同）
Communication-Efficient Collaborative Perception via Information Filling with Codebook [链接] · 经典论文（通信高效）
Robust Real-time Multi-vehicle Collaboration on Asynchronous Sensors [链接] · 经典论文（MobiCom）
Leveraging Large Vision Model for Multi-UAV Co-perception in Low-Altitude Wireless Networks [链接] · 前沿引导
Vision-and-Language Navigation Today and Tomorrow: A Survey in the Era of Foundation Models [链接] · 综述
TRAVEL: Towards Realistic UAV Vision-Language Navigation: Platform, Benchmark, and Methodology [链接] · 系统主线
STAMP: Scalable Task And Model-agnostic Collaborative Perception [链接] · 系统主线
CoSDH: Communication-Efficient Collaborative Perception via Supply-Demand Awareness and Intermediate-Late Hybridization [链接] · 系统主线
ViSA-Enhanced Aerial VLN: A Visual-Spatial Reasoning Enhanced Framework for Aerial Vision-Language Navigation [链接] · 专题
mmCooper: A Multi-agent Multi-stage Communication-efficient and Collaboration-robust Cooperative Perception Framework [链接] · 专题
SiMO: Single-Modality-Operable Multimodal Collaborative Perception [链接] · 专题

代码仓库

OPENCOOD · 推荐，协同感知最流行的开源代码库
CoAlign · 推荐，相较于OPENCOOD支持更多的数据集
HEAL · 异构设置下的协同感知代码库
OpenUAV · 适合上手 UAV-VLN 场景下的平台搭建、任务理解与基准复现
STAMP Official Implementation · 适合上手协同感知主线方法与系统复现
CoSDH · 适合实践低通信开销条件下的协同感知复现

方向 B

边缘大模型推理与部署

边缘部署推理引擎量化加速大小模型协同

代表性论文 / 综述

A Survey on Inference Engines for Large Language Models: Perspectives on Optimization and Efficiency [链接] · 综述
Mobile Edge Intelligence for Large Language Models: A Contemporary Survey [链接] · 综述
Jupiter: Fast and Resource-Efficient Collaborative Inference of Generative LLMs on Edge Devices [链接] · 系统主线
EdgeShard: Efficient LLM Inference via Self-Adaptive Model Sharding [链接] · 系统主线
Large-Small Model Collaboration in Mobile Edge Networks With Heterogeneous Computational Resources [链接] · 系统主线
SpecPipe: Accelerating Pipeline Parallelism-based LLM Inference with Speculative Decoding [链接] · 专题
Dynamic Model Routing and Cascading for Efficient LLM Inference: A Survey [链接] · 专题
Compact LLM Deployment and World Model Assisted Offloading in Mobile Edge Computing [链接] · 专题

代码仓库

llama.cpp · 适合新手入门本地量化部署与边缘端推理
MLC-LLM · 适合上手移动端与边缘设备上的大模型部署
SGLang · 适合实践推理引擎、投机解码与服务化部署

方向 C

边缘多智能体系统

多智能体Agent记忆机制强化学习

代表性论文 / 综述

Large language model based multi-agents: a survey of progress and challenges [链接] · 综述
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey [链接] · 综述
G-Memory: Tracing Hierarchical Memory for Multi-Agent Systems [链接] · 系统主线
AgentRL: Scaling Multi-turn and Reasoning Capabilities of Language Agents via Reinforcement Learning [链接] · 系统主线
MEMO: Memory-Augmented Model Context Optimization for Robust Multi-Turn Multi-Agent LLM Games [链接] · 系统主线
Teams of LLM Agents can Exploit Zero-Day Vulnerabilities [链接] · 专题
Multimodal Large Language Models-Enabled UAV Swarm: Towards Efficient and Intelligent Autonomous Aerial Systems [链接] · 专题
FlowPIE: Test-Time Scientific Idea Evolution with Flow-Guided Literature Exploration [链接] · 专题

代码仓库

LangChain · 适合先熟悉 Agent、工具调用与工作流基础能力
LangGraph · 适合进一步实践有状态多 Agent 编排与流程控制
GMemory · 适合上手多智能体记忆机制与层次化状态管理的实现
AgentRL · 适合实践语言智能体的多轮决策与强化学习训练

Learning Package

分类学习资料

如果你还没有确定具体研究方向，建议先从经验、课程和书籍开始，再根据需要进入科研工具、编程开发和开源工具部分。

经验

熊辉：为什么人前进的路总是被自己挡住 · 经验文章
The Life-Changing Skill of Emotional Regulation - Dr Marc Brackett · 人生经验 · 情绪调节

课程

MIT Introduction to Algorithms · 基础算法 · 算法入门
Prof. Hung-yi Lee Machine Learning · 机器学习 · 经典课程
Andrew Ng Deep Learning Specialization · 深度学习 · 体系化入门
【李沐】动手学深度学习 · 深度学习 · 中文入门课程
【王树森】深度强化学习（DRL） · 深度强化学习 · 强化学习课程
《动手学大模型》系列编程实践教程 · 大模型实践 · 开源教程，适合系统入门大模型开发与实践

书籍

周志华《机器学习》 · 中文教材 · 机器学习基础
Dive into Deep Learning · 深度学习 · 可交互教材
Deep Learning · 深度学习经典教材 · Goodfellow / Bengio / Courville
Build a Large Language Model (From Scratch) · 大模型 · Sebastian Raschka / Manning，适合从零理解 GPT 风格大模型实现

科研工具

Google Scholar · 论文检索 · 常用入口
Semantic Scholar · 学术搜索 · 适合快速追踪引用
Zotero · 文献管理 · 文献整理与引用
CCF Deadline · 截稿日历 · 会议截稿查询
一份（不太）简短的 LATEX 2ε 介绍 · LaTeX · 中文入门手册

编程开发

Python Documentation · Python · 官方文档
PyTorch Docs · 深度学习框架 · 官方文档
Claude Code · AI 编程助手 · 适合日常代码生成、重构与命令行协作开发
Codex CLI · AI 编程助手 · 适合终端内代码生成、修改与任务执行
Git Documentation · 版本管理 · 协作开发基础
廖雪峰 Git 教程 · Git 教程 · 中文入门教程

开源工具

Hugging Face Docs · 模型与数据集 · 常用入口
LangChain Docs · LLM 应用开发 · Agent 与 RAG 常用工具