A document share for 学术新星计划2020分享会
Repository
Repository di Xiao9905
Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models
The repository contains an ongoing collection of tweets IDs associated with the novel coronavirus COVID-19 (SARS-CoV-2), which commenced on January 28, 2020.
Transformer related optimization, including BERT, GPT
GLM (General Language Model)
GLM-130B: An Open Bilingual Pre-Trained Model
GraphMAE: Self-supervised Masked Graph Autoencoders
DGL tutorial in KDD 2019
Must-read papers on knowledge graph reasoning
Source code and dataset for KDD 2019 paper "OAG: Toward Linking Large-scale Heterogeneous Entity Graphs"
codes for OAG_know and GloMoCo: Unsupervised Embedding Training for Concept Linking
An optimized prompt tuning strategy comparable to fine-tuning across model scales and tasks.
Must-read papers on prompt-based tuning for pre-trained language models.
清华大学计算机系课程攻略 Guidance for courses in Department of Computer Science and Technology, Tsinghua University
SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multimodal models (LMMs) such as GPT-4V(ision).