Help Wantednever-stale
Description
This is a list of tasks not yet in ParlAI that would be great to have. Feel free to add more to the list also! We will remove individual items when they are done.
Chit Chat
- DailyDialog https://arxiv.org/abs/1710.03957
- Datasets in decaNLP that are missing: https://github.com/salesforce/decaNLP
- CoLA https://nyu-mll.github.io/CoLA/
- Movie Discussions with Knowledge: https://arxiv.org/pdf/1809.08205.pdf
- MultiWoz : https://arxiv.org/abs/1810.00278
- Video stories?: https://research.fb.com/wp-content/uploads/2018/10/A-Dataset-for-Telling-the-Stories-of-Social-Media-Videos.pdf?
- AirDialogue http://www.aclweb.org/anthology/D18-1419
- Movie chat with background knowledge: http://aclweb.org/anthology/D18-1255,
- Movie chat with Wikipedia grounding: http://aclweb.org/anthology/D18-1076, https://github.com/festvox/datasets-CMU_DoG
- Craiglist bargain http://aclweb.org/anthology/D18-1256
- Datasets from DSTC7 http://alborz-geramifard.com/workshops/nips18-Conversational-AI/Papers/18convai-DSTC7.pdf
- Movie recommendation: https://www.microsoft.com/en-us/research/uploads/prod/2018/11/deep_conversational_recommendations__1_1.pdf
- Redial dataset: https://redialdata.github.io/website/
- OTTers https://arxiv.org/pdf/2105.13710.pdf
Knowledge-grounded datasets:
- Conversational reading (https://arxiv.org/pdf/1906.02738.pdf)
- Knowledge Dataset from DSTC7 https://github.com/DSTC-MSR-NLP/DSTC7-End-to-End-Conversation-Modeling/tree/master/data_extraction
- Holl-E (https://github.com/nikitacs16/Holl-E, https://arxiv.org/abs/1809.08205)
- OpenDialKG (https://github.com/facebookresearch/opendialkg)
Visual Dialogue / QA Tasks / Captioning:
- KVQA http://dosa.cds.iisc.ac.in/kvqa-2/01/mishra_CR.pdf (see paper for links to other VQA too)
- GQA (VQA-type) dataset https://cs.stanford.edu/people/dorarad/gqa/
- Visual Storytelling https://arxiv.org/pdf/1604.03968.pdf
- Multimodal shopping dialogue (with images) https://arxiv.org/pdf/1704.00200.pdf
- Visual Commonsense reasoning https://visualcommonsense.com/
- Netizen-Style Commenting on Fashion Photos, https://mashyu.github.io/NSC/
- Conceptual Captions https://github.com/google-research-datasets/conceptual-captions
QA Tasks:
- Natural Questions: https://ai.google/research/pubs/pub47761
- HotpotQA: https://hotpotqa.github.io/
- SearchQA https://github.com/nyu-dl/SearchQA
- Who Did What (cloze qa): https://arxiv.org/abs/1608.05457
- NewsQA https://datasets.maluuba.com/NewsQA
- QuAC: https://arxiv.org/pdf/1808.07036.pdf
- CoQA (#1674): https://arxiv.org/abs/1808.07042
- DREAM Dialogue QA https://arxiv.org/pdf/1902.00164.pdf
- DROP https://arxiv.org/abs/1903.00161
- Common Sense from ConceptNet https://arxiv.org/pdf/1811.00937.pdf
- AmazonQA: http://jmcauley.ucsd.edu/data/amazon/qa/