Repositories

yangjianxin1 repositories

27 supported repositories

Chinese NER(Named Entity Recognition) using BERT(Softmax, CRF, Span)

Last commit May 29, 2021

 (2 stars) (0 forks) (0 indexed issues) (0 open good first issues)

这里是改进了pytorch的DataParallel, 用来平衡第一个GPU的显存使用量

Last commit Nov 5, 2020

 (0 stars) (0 forks) (0 indexed issues) (0 open good first issues)

中文CLIP预训练模型

Last commit Dec 5, 2022

 (419 stars) (60 forks) (0 indexed issues) (0 open good first issues)

Easy-to-use CPM for Chinese text generation(基于CPM的中文文本生成)

Last commit Jun 19, 2022

 (532 stars) (129 forks) (0 indexed issues) (0 open good first issues)

ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型

Last commit Nov 20, 2023

 (0 stars) (0 forks) (0 indexed issues) (0 open good first issues)

中文LLaMA&Alpaca大语言模型+本地CPU部署 (Chinese LLaMA & Alpaca LLMs)

Last commit Apr 10, 2023

 (6 stars) (0 forks) (0 indexed issues) (0 open good first issues)

基于ClipCap的看图说话Image Caption模型

Last commit Apr 1, 2022

 (324 stars) (44 forks) (0 indexed issues) (0 open good first issues)

Let us control diffusion models

Last commit Feb 16, 2023

 (0 stars) (0 forks) (0 indexed issues) (0 open good first issues)

Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Last commit Oct 24, 2024

 (6,638 stars) (585 forks) (0 indexed issues) (0 open good first issues)

Firefly中文LLaMA-2大模型,支持增量预训练Baichuan2、Llama2、Llama、Falcon、Qwen、Baichuan、InternLM、Bloom等大模型

Last commit Oct 21, 2023

 (414 stars) (32 forks) (0 indexed issues) (0 open good first issues)

GPT2 for Chinese chitchat/用于中文闲聊的GPT2模型(实现了DialoGPT的MMI思想)

Last commit Apr 6, 2023

 (2,999 stars) (668 forks) (0 indexed issues) (0 open good first issues)

基于词汇信息融合的中文NER模型

Last commit Apr 2, 2022

 (172 stars) (19 forks) (0 indexed issues) (0 open good first issues)

Last commit Apr 6, 2023

 (310 stars) (22 forks) (0 indexed issues) (0 open good first issues)

LongQLoRA: Extent Context Length of LLMs Efficiently

Last commit Nov 11, 2023

 (169 stars) (15 forks) (0 indexed issues) (0 open good first issues)

transformers结构的中文OFA模型

Last commit Feb 13, 2023

 (138 stars) (16 forks) (0 indexed issues) (0 open good first issues)

使用Python复现SIGKDD2017的PAMAE算法(并行k-medoids算法)/The Python implementation of SIGKDD 2017's PAMAE algorithm (parallel k-medoids algorithm)

Last commit Jan 1, 2020

 (33 stars) (9 forks) (0 indexed issues) (0 open good first issues)

基于Scrapy的QQ音乐爬虫(QQ Music Spider),爬取歌曲信息、歌词、精彩评论等,并且分享了QQ音乐中排名前6400名的内地和港台歌手的49万+的音乐语料

Last commit May 5, 2022

 (364 stars) (71 forks) (0 indexed issues) (0 open good first issues)

Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.

Last commit Aug 5, 2024

 (1 star) (0 forks) (0 indexed issues) (0 open good first issues)

对比学习 虾皮同款商品匹配

Last commit Jan 29, 2022

 (17 stars) (2 forks) (0 indexed issues) (0 open good first issues)

SimCSE有监督与无监督实验复现

Last commit Apr 5, 2022

 (151 stars) (26 forks) (0 indexed issues) (0 open good first issues)