FluxML/FastAI.jl

Dataset recipes

Open

#153 aberto em 10 de ago. de 2021

Ver no GitHub
 (14 comments) (1 reaction) (0 assignees)Julia (49 forks)batch import
good first issuehelp wanted

Métricas do repositório

Stars
 (592 stars)
Métricas de merge de PR
 (Nenhuma PRs mesclada em 30d)

Description

With #151, FastAI.jl is getting high-level interfaces for searching datasets (finddatasets) and loading datasets into task-specific data containers (loaddataset). There is also a new DatasetRecipe that encapsulates configuration for loading a data container and the block information from a path. These recipes can be registered with a dataset so that they can be found using the above high-level functions.

The fastai dataset colletion comes with quite a lot of datasets, so only a few have recipes yet. This issue tracks the progress on adding recipes to all the datasets. Contributions of recipe types and recipe configs for datasets are welcome.

See src/datasets/recipes.jl for example recipe implementations and src/datasets/fastairegistry for how recipes are registered. listdatasources() gives you a list of all dataset sources and datasetpath(name) downloads them and returns the download folder.

Progress

For datasets that can be used for multiple tasks, they are listed below. Otherwise a checked dataset that at least one recipe is already implemented.

  • CUB_200_2011
  • bedroom (not sure how the folders are layed out)
  • caltech_101
  • cifar10
  • cifar100
  • food-101
  • imagenette-160
  • imagenette-320
  • imagenette
  • imagenette2-160
  • imagenette2-320
  • imagenette2
  • imagewang-160
  • imagewang-320
  • imagewang
  • imagewoof-160
  • imagewoof-320
  • imagewoof
  • imagewoof2-160
  • imagewoof2-320
  • imagewoof2
  • mnist_png
  • mnist_var_size_tiny
  • oxford-102-flowers
  • oxford-iiit-pet
  • stanford-cars
  • ag_news_csv
  • amazon_review_full_csv
  • amazon_review_polarity_csv
  • dbpedia_csv
  • giga-fren
  • imdb
  • sogou_news_csv
  • wikitext-103
  • wikitext-2
  • yahoo_answers_csv
  • yelp_review_full_csv
  • yelp_review_polarity_csv
  • biwi_head_pose
  • camvid
  • pascal-voc
  • pascal_2007
    • multi-label image classification ((Image{2}, LabelMulti))
    • object detection
  • pascal_2012
  • siim_small
  • skin-lesion
  • tcga-small
  • adult_sample
  • biwi_sample
  • camvid_tiny
  • dogscats
  • human_numbers
  • imdb_sample
  • mnist_sample
  • mnist_tiny
  • movie_lens_sample
  • planet_sample
  • planet_tiny
  • coco_sample
  • coco-train2017
  • coco-val2017
  • coco-test2017
  • coco-unlabeled2017
  • coco-image_info_test2017
  • coco-image_info_unlabeled2017
  • coco-annotations_trainval2017
  • coco-stuff_annotations_trainval2017
  • coco-panoptic_annotations_trainval2017

Guia do colaborador