Group Member: Mu Cai, Yunyu(Bella) Bai, Xuechun Yang
Source code and dataset available at: https://github.com/mu-cai/cs766_21spring
Related Work
The Traditional DL to Train a Model: ERM
data:image/s3,"s3://crabby-images/b4645/b46453c80394829e1edf6423f713b5bc8b622c4b" alt="ERM.png"
State-of-the-art Methods that Reduce Group Shifts: DRO
data:image/s3,"s3://crabby-images/e6c8d/e6c8d4a9c98c72ac95a015802f801f7750d3c02a" alt="DRO 2.png"
The Performance of the Above Two Methods
Even though DRO focus on minimizing the worst-group loss, the test accuracy of the minor groups is still far from that of the major group!
data:image/s3,"s3://crabby-images/c921a/c921a817859640a688e0df0b5e6def1ceed4702c" alt="Screen Shot 2021-04-29 at 11.35.10 PM.pn"
Current Limitation 1: Synthesized Dataset
The current community utilizes fake data to construct the group-shifted dataset, which doesn’t reflect the distribution of natural images, blocking its real-world applications.
As shown in the picture below, this dataset is constructed by simply stitching a background image and a foreground object.
To facilitate the research in group shifts for the community, we collect a large-scale natural image dataset via web-crawler.
data:image/s3,"s3://crabby-images/cebb9/cebb9cda6afbe6596648a3d3ba724d5e056dfa2f" alt="Limitation 1.png"
Current Limitation 2: OOD Dataset
Besides, the current research community also doesn’t consider its robustness towards out-of-distribution samples. Real world test images has a wide span of distribution. Therefore, determining whether test images belong to the in-distribution set is critical, which is not yet studied in the community. Here we study the robustness of the neural network models under four diverse high resolution out-of-distribution datasets.
data:image/s3,"s3://crabby-images/8f91f/8f91f512599a7d08d37131641d9c2122d0c06660" alt="Limitation 2.png"