永雏多氢菲の書库

Literature 105 CS 19 Science 13 Paper 3 Uncategorized 3 Philosophy 1

Pinned

1970-01-01

#No Tags

Cover Image of the Post

芥川龍之介: 片恋

2026-06-19

单相思

Cover Image of the Post

村上春樹: 羊をめぐる冒険第二章 1978/7月

2026-06-16

寻羊历险记

Cover Image of the Post

村上春樹: 羊をめぐる冒険第一章 1970/11/25

2026-06-16

寻羊历险记

Cover Image of the Post

芥川龍之介: 運

2026-06-15

运气

Cover Image of the Post

RoBERTa 的源码 [LLM]

2026-06-15

RoBERTa (Robustly optimized BERT approach) 是对 BERT 的一种改进。作者认为原版 BERT 存在训练不足的问题，在不改变模型架构的基础上针对预训练过程做出了改进：移除了 SNP 任务，改变 BERT 的 MLM 方法，对训练数据使用动态掩码策略。

#No Tags

Cover Image of the Post

BERT 模型的介绍和应用 [LLM]

2026-06-14

标准的 BERT 模型是一个双向 Transformer encoder，通过两个自监督任务进行训练：掩码语言模型和下一句预测。整体的训练损失是这两个任务损失的和：

#No Tags

Cover Image of the Post

芥川龍之介: 偸盗

2026-06-13

偷盗

Cover Image of the Post

使用 Beamer 实现一个简单的 ppt [LaTeX]

2026-06-12

本节介绍 Beamer 的一些常用功能，教程来自官方文档:

#No Tags

Cover Image of the Post

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

2026-06-12

语言模型预训练已被证实能够有效提升多项 NLP 任务的效果。将预训练语言表征应用于下游任务主要有两种主流方式：基于特征的方法与微调方法。以 ELMo 为代表的特征式方法，会设计专属任务架构，并把预训练表征当作额外特征使用；而以 GPT 为代表的微调式方法，仅引入少量任务专属参数，直接对全部预训练参数进行微调以适配下游任务。两种方法的预训练目标一致，均采用单向语言模型学习通用语言表征。

#No Tags

Cover Image of the Post

永雏多氢菲

∴さて····どこへ行こうか？

随缘分享喵

あ行か行さ行た行な行ま行哲学生物学轻小说

Posts

144

Categories

6

Tags

9

Total Words

2,255,454

Running Days

0 days

Last Activity

0 days ago

Table of Contents