Skip to content

Commit

Permalink
blog: initial sharding (ishard)
Browse files Browse the repository at this point in the history
Co-authored-by: Alex Aizman <[email protected]>
Signed-off-by: Alex Aizman <[email protected]>
  • Loading branch information
Nahemah1022 and alex-aizman committed Aug 16, 2024
1 parent 18dc104 commit 35bca58
Show file tree
Hide file tree
Showing 7 changed files with 424 additions and 0 deletions.
387 changes: 387 additions & 0 deletions docs/_posts/2024-08-16-ishard.md

Large diffs are not rendered by default.

Binary file added docs/assets/ishard/ishard-base_file_name.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs/assets/ishard/ishard-full_name.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs/assets/ishard/ishard-ml-buckets.gif
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs/assets/ishard/ishard-throughput-plot.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs/assets/ishard/ishard_workflow.gif
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
37 changes: 37 additions & 0 deletions docs/examples/ishard-imagenet/download.sh
Original file line number Diff line number Diff line change
@@ -0,0 +1,37 @@
# Download all training data from ImageNet official website
# This takes approximately 200 GB disk space
nohup wget https://image-net.org/data/ILSVRC/2012/ILSVRC2012_img_train.tar
tar xf $IMAGENET_HOME/ILSVRC2012_img_train.tar -C $IMAGENET_HOME/train
cd $IMAGENET_HOME/train
for f in *.tar; do
d=`basename $f .tar`
mkdir $d
tar xf $f -C $d
rm $f
done

# Prepare validation data
wget https://image-net.org/data/ILSVRC/2012/ILSVRC2012_img_val.tar
tar xf $IMAGENET_HOME/ILSVRC2012_img_val.tar -C $IMAGENET_HOME/validation

# Prepare features
wget https://image-net.org/data/ILSVRC/2010/ILSVRC2010_feature_sbow_train.tar
tar xf $IMAGENET_HOME/ILSVRC2010_feature_sbow_train.tar -C $IMAGENET_HOME/label

# Prepare Annotations
wget https://image-net.org/data/ILSVRC/2012/ILSVRC2012_bbox_train_v2.tar.gz
tar xf $IMAGENET_HOME/ILSVRC2012_bbox_train_v2.tar.gz -C $IMAGENET_HOME/train_annotation

wget https://image-net.org/data/ILSVRC/2012/ILSVRC2012_bbox_val_v3.tgz
tar xf $IMAGENET_HOME/ILSVRC2012_bbox_train_v2.tar.gz -C $IMAGENET_HOME/validation_annotation

cd $IMAGENET_HOME/annotation
for f in *.tar.gz; do
d=`basename $f .tar.gz`
mkdir $d
tar xf $f -C $d
rm $f
done

# Remove original tar files to free up disk space
rm ILSVRC2012_bbox_train_v2 ILSVRC2012_bbox_val_v3.tgz ILSVRC2010_feature_sbow_train.tar ILSVRC2012_img_val.tar ILSVRC2012_img_train.tar nohup.out

0 comments on commit 35bca58

Please sign in to comment.