[Paper Review] Invariant Information Clustering for Unsupervised Image Classification and Segmentation(2018)

논문 읽기/Self-Supervised

[Paper Review] Invariant Information Clustering for Unsupervised Image Classification and Segmentation(2018)

AI 꿈나무 2021. 7. 30. 12:47

Invariant Information Clustering for Unsupervised Image Classification and Segmentation

Xu Ji, Joao F.Henriques, Andrea Vedaldi, arXiv 2018

PDF, Clustering By SeonghoonYu July 30th, 2021

Summary

This paper presents IIC model which acieves SOTA performance on Image clustering and Image segmentation by maximizing the mutual information between the original image and the transformed image from original image.

Optional overclustering is to learn useful representations from the noisy unlabelled subset.

(1) Image Clustering

maximizing the mutual information between encoded variables.

Calculate the joint probability distributions matrix P, where each element at row c and column c' constitutes $P_{cc'} = P(z=c,z'=c')$

symetrize

(2) Image Segmentation

They maximize the mutual information between image patches on image segmentation task.

(3) Degenerate solutions are avoided

Avoid degenerate solutions that other clustering methods are susceptible to by using the mutual information.

Maximizing I(z,z') trade-off minimizing the conditional cluster assignment entropy H(zlz') and maximising individual cluster assignments entropy H(z). The smallest value of H(zlz') is 0, obtained when the cluster assignments are exactly predictable from each other. The largest value of H(z) is lnC, obtained when all clusters are equally likely to be picked. This occurs when the data is assigned evenly between the clusters, equalizing their mass.

simese 기반 모델인데 contrastive loss라는게 기본적으로 P(y=i l x', x_i) 형태로 나온다. 어떤 말이냐면 negative set에서 affine transform된 자신을 찾을 수 있어야 로스값이 내려가는 구조. 이게 H(z;z')가 최소값.

H(z)이 최대값이 되려면 모든 cluster가 균일하게 pick되는 경우인데, 이는 p(z)가 uniform 하다는 것을 의미함. p(z)이 유니폼할때 H(z) = 평균(log(p(z)))에서 최대값이 되나?

Experiment

What I like about the paper

avoid degenerate problems that other clustering methods are susceptible to by using the mutual information
interesting solving unsupervised image segmentation task using clustering

my github about what i read

Seonghoon-Yu/Paper_Review_and_Implementation_in_PyTorch

공부 목적으로 논문을 리뷰하고 해당 논문 파이토치 재구현을 합니다. Contribute to Seonghoon-Yu/Paper_Review_and_Implementation_in_PyTorch development by creating an account on GitHub.

github.com

'논문 읽기 > Self-Supervised' 카테고리의 다른 글

[논문 읽기] Contrastive Multiview Coding (0)	2021.08.06
[Paper Review] Rotation(2018), Unsupervised Representation Learning by Pre-diction Image Rotations (0)	2021.08.04
[Paper Review] SimCLRv2(2020), Big Self-Supervised Models are Strong Semi-Supervised Learners (0)	2021.07.26
[Paper Review] Unsupervised Learning of Visual Representations using Videos(2015) (0)	2021.07.23
[Paper Review] Unsupervised Feature Learning via Non-Parametric Instance Discrimination(2018) (0)	2021.07.22

현재글[Paper Review] Invariant Information Clustering for Unsupervised Image Classification and Segmentation(2018)

딥러닝 공부방

[Paper Review] Invariant Information Clustering for Unsupervised Image Classification and Segmentation(2018)

Invariant Information Clustering for Unsupervised Image Classification and Segmentation