Repository

Repository di yangjianxin1

Chinese NER(Named Entity Recognition) using BERT(Softmax, CRF, Span)

Ultimo commit 29 mag 2021

 (2 star) (0 fork) (0 issue indicizzate) (0 good first issue aperte)

这里是改进了pytorch的DataParallel, 用来平衡第一个GPU的显存使用量

Ultimo commit 5 nov 2020

 (0 star) (0 fork) (0 issue indicizzate) (0 good first issue aperte)

中文CLIP预训练模型

Ultimo commit 5 dic 2022

 (419 star) (60 fork) (0 issue indicizzate) (0 good first issue aperte)

Easy-to-use CPM for Chinese text generation(基于CPM的中文文本生成)

Ultimo commit 19 giu 2022

 (532 star) (129 fork) (0 issue indicizzate) (0 good first issue aperte)

ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型

Ultimo commit 20 nov 2023

 (0 star) (0 fork) (0 issue indicizzate) (0 good first issue aperte)

中文LLaMA&Alpaca大语言模型+本地CPU部署 (Chinese LLaMA & Alpaca LLMs)

Ultimo commit 10 apr 2023

 (6 star) (0 fork) (0 issue indicizzate) (0 good first issue aperte)

基于ClipCap的看图说话Image Caption模型

Ultimo commit 1 apr 2022

 (324 star) (44 fork) (0 issue indicizzate) (0 good first issue aperte)

Let us control diffusion models

Ultimo commit 16 feb 2023

 (0 star) (0 fork) (0 issue indicizzate) (0 good first issue aperte)

Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Ultimo commit 24 ott 2024

 (6638 star) (585 fork) (0 issue indicizzate) (0 good first issue aperte)

Firefly中文LLaMA-2大模型,支持增量预训练Baichuan2、Llama2、Llama、Falcon、Qwen、Baichuan、InternLM、Bloom等大模型

Ultimo commit 21 ott 2023

 (414 star) (32 fork) (0 issue indicizzate) (0 good first issue aperte)

GPT2 for Chinese chitchat/用于中文闲聊的GPT2模型(实现了DialoGPT的MMI思想)

Ultimo commit 6 apr 2023

 (2999 star) (668 fork) (0 issue indicizzate) (0 good first issue aperte)

基于词汇信息融合的中文NER模型

Ultimo commit 2 apr 2022

 (172 star) (19 fork) (0 issue indicizzate) (0 good first issue aperte)

Ultimo commit 6 apr 2023

 (310 star) (22 fork) (0 issue indicizzate) (0 good first issue aperte)

LongQLoRA: Extent Context Length of LLMs Efficiently

Ultimo commit 11 nov 2023

 (169 star) (15 fork) (0 issue indicizzate) (0 good first issue aperte)

transformers结构的中文OFA模型

Ultimo commit 13 feb 2023

 (138 star) (16 fork) (0 issue indicizzate) (0 good first issue aperte)

使用Python复现SIGKDD2017的PAMAE算法(并行k-medoids算法)/The Python implementation of SIGKDD 2017's PAMAE algorithm (parallel k-medoids algorithm)

Ultimo commit 1 gen 2020

 (33 star) (9 fork) (0 issue indicizzate) (0 good first issue aperte)

基于Scrapy的QQ音乐爬虫(QQ Music Spider),爬取歌曲信息、歌词、精彩评论等,并且分享了QQ音乐中排名前6400名的内地和港台歌手的49万+的音乐语料

Ultimo commit 5 mag 2022

 (364 star) (71 fork) (0 issue indicizzate) (0 good first issue aperte)

Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.

Ultimo commit 5 ago 2024

 (1 star) (0 fork) (0 issue indicizzate) (0 good first issue aperte)

对比学习 虾皮同款商品匹配

Ultimo commit 29 gen 2022

 (17 star) (2 fork) (0 issue indicizzate) (0 good first issue aperte)

SimCSE有监督与无监督实验复现

Ultimo commit 5 apr 2022

 (151 star) (26 fork) (0 issue indicizzate) (0 good first issue aperte)