open-guides/og-aws

S3: Discuss ways to list and tally objects efficiently

Open

#58 建立於 2016年9月2日

在 GitHub 查看
 (9 留言) (0 反應) (0 負責人)Shell (36,412 star) (3,905 fork)batch import
help wantedunder discussion

描述

Topics:

  • Listing and pagination
  • Need for multi-threaded S3 crawl over keys for speed
    • Prefix-based listings, with separators
    • Hash-type prefixes with known alphabet, uniform distribution
  • Possibly: Reassigning work; using markers to optimize if alphabet is not known
  • Tallying usage by mapreduce over keys that propagate usage up by folder

https://docs.aws.amazon.com/AmazonS3/latest/API/RESTBucketGET.html

貢獻者指南

S3: Discuss ways to list and tally objects efficiently · open-guides/og-aws#58 | Good First Issue