Add Once-for-all to pytorchhub (pytorch#293)

zjhzjh111 · Jul 6, 2022 · a077823 · a077823
1 parent 774baf7
commit a077823
Show file tree

Hide file tree

Showing 3 changed files with 142 additions and 0 deletions.
diff --git a/images/ofa_imagenet_results.png b/images/ofa_imagenet_results.png
diff --git a/images/once_for_all_overview.png b/images/once_for_all_overview.png
diff --git a/pytorch_vision_once_for_all.md b/pytorch_vision_once_for_all.md
@@ -0,0 +1,142 @@
+---
+layout: hub_detail
+background-class: hub-background
+body-class: hub
+title: "Once-for-All"
+category: researchers
+summary: Once-for-all (OFA) decouples training and search, and achieves efficient inference across various edge devices and resource constraints.
+image: once_for_all_overview.png
+author: MIT Han Lab
+tags: [vision, scriptable]
+github-link: https://github.com/mit-han-lab/once-for-all
+github-id: mit-han-lab/once-for-all
+featured_image_1: once_for_all_overview.png
+featured_image_2: no-image
+accelerator: cuda-optional
+---
+
+
+
+### Get supernet
+
+You can quickly load a supernet as following
+
+```python
+import torch
+super_net_name = "ofa_supernet_resnet50" 
+# other options: 
+#    ofa_mbv3_d234_e346_k357_w1.0 / 
+#    ofa_mbv3_d234_e346_k357_w1.2 / 
+#    ofa_proxyless_d234_e346_k357_w1.3
+
+super_net = torch.hub.load('mit-han-lab/once-for-all', super_net_name, pretrained=True).eval()
+```
+
+| OFA Network         | Design Space | Resolution | Width Multiplier |  Depth |  Expand Ratio |  kernel Size | 
+|----------------------|----------|----------|---------|------------|---------|------------|
+| ofa_resnet50 | ResNet50D | 128 - 224 | 0.65, 0.8, 1.0 | 0, 1, 2 | 0.2, 0.25, 0.35 | 3 |
+| ofa_mbv3_d234_e346_k357_w1.0 | MobileNetV3 | 128 - 224 | 1.0 | 2, 3, 4 | 3, 4, 6 | 3, 5, 7 |
+| ofa_mbv3_d234_e346_k357_w1.2 | MobileNetV3 | 160 - 224 | 1.2 | 2, 3, 4 | 3, 4, 6 | 3, 5, 7 |
+| ofa_proxyless_d234_e346_k357_w1.3 | ProxylessNAS | 128 - 224 | 1.3 | 2, 3, 4 | 3, 4, 6 | 3, 5, 7 |
+
+
+Below are the usage of sampling / selecting a subnet from the supernet 
+
+```python
+# Randomly sample sub-networks from OFA network
+super_net.sample_active_subnet()
+random_subnet = super_net.get_active_subnet(preserve_weight=True)
+
+# Manually set the sub-network
+super_net.set_active_subnet(ks=7, e=6, d=4)
+manual_subnet = super_net.get_active_subnet(preserve_weight=True)
+```
+
+
+### Get Specialized Architecture
+
+```python
+import torch
+
+# or load a architecture specialized for certain platform
+net_config = "resnet50D_MAC_4_1B"
+# other options
+#    [email protected][email protected]
+#    [email protected][email protected]
+#    [email protected][email protected]
+#    [email protected][email protected]
+#    [email protected][email protected]
+#    [email protected][email protected]_finetune@25
+#    [email protected][email protected]_finetune@25
+#    [email protected][email protected]_finetune@25
+
+specialized_net = torch.hub.load('mit-han-lab/once-for-all', net_config, pretrained=True).eval()
+```
+
+More models and configurations can be found in [once-for-all/model-zoo](https://github.com/mit-han-lab/once-for-all#evaluate-1)
+and obtained through the following scripts
+
+```python
+ofa_specialized_get = torch.hub.load('mit-han-lab/once-for-all', "ofa_specialized_get")
+model = ofa_specialized_get("flops@[email protected]_finetune@75", pretrained=True)
+```
+
+The model's prediction can be evalutaed by 
+```python
+# Download an example image from pytorch website
+import urllib
+url, filename = ("https://github.com/pytorch/hub/raw/master/images/dog.jpg", "dog.jpg")
+try: 
+  urllib.URLopener().retrieve(url, filename)
+except: 
+  urllib.request.urlretrieve(url, filename)
+
+
+# sample execution (requires torchvision)
+from PIL import Image
+from torchvision import transforms
+input_image = Image.open(filename)
+preprocess = transforms.Compose([
+    transforms.Resize(256),
+    transforms.CenterCrop(224),
+    transforms.ToTensor(),
+    transforms.Normalize(mean=[0.485, 0.456, 0.406], std=[0.229, 0.224, 0.225]),
+])
+input_tensor = preprocess(input_image)
+input_batch = input_tensor.unsqueeze(0) # create a mini-batch as expected by the model
+
+# move the input and model to GPU for speed if available
+if torch.cuda.is_available():
+    input_batch = input_batch.to('cuda')
+    model.to('cuda')
+
+with torch.no_grad():
+    output = model(input_batch)
+# Tensor of shape 1000, with confidence scores over Imagenet's 1000 classes
+print(output[0])
+# The output has unnormalized scores. To get probabilities, you can run a softmax on it.
+probabilities = torch.nn.functional.softmax(output[0], dim=0)
+print(probabilities)
+
+```
+
+
+### Model Description
+Once-for-all models are from [Once for All: Train One Network and Specialize it for Efficient Deployment](https://arxiv.org/abs/1908.09791). Conventional approaches either manually design or use neural architecture search (NAS) to find a specialized neural network and train it from scratch for each case, which is computationally prohibitive (causing CO2 emission as much as 5 cars' lifetime) thus unscalable. In this work, we propose to train a once-for-all (OFA) network that supports diverse architectural settings by decoupling training and search. Across diverse edge devices, OFA consistently outperforms state-of-the-art (SOTA) NAS methods (up to 4.0% ImageNet top1 accuracy improvement over MobileNetV3, or same accuracy but 1.5x faster than MobileNetV3, 2.6x faster than EfficientNet w.r.t measured latency) while reducing many orders of magnitude GPU hours and CO2 emission. In particular, OFA achieves a new SOTA 80.0% ImageNet top-1 accuracy under the mobile setting (<600M MACs).
+
+<!-- ![](images/ofa_imagenet_results.png) -->
+<img src="https://github.com/mit-han-lab/once-for-all/raw/master/figures/cnn_imagenet_new.png"  width="100%"/>
+
+
+### References
+
+```
+@inproceedings{
+  cai2020once,
+  title={Once for All: Train One Network and Specialize it for Efficient Deployment},
+  author={Han Cai and Chuang Gan and Tianzhe Wang and Zhekai Zhang and Song Han},
+  booktitle={International Conference on Learning Representations},
+  year={2020},
+  url={https://arxiv.org/pdf/1908.09791.pdf}
+}
+```