논문 읽기/Classification

Classification에서 MSE(mean square error) vs Cross-Entropy

AI 꿈나무 2022. 1. 12. 04:39

Evaluation of Neural Architectures Trained with Square Loss vs Cross-Entropy in Classification Tasks

https://arxiv.org/abs/2006.07322

Modern neural architectures for classification tasks are trained using the cross-entropy loss, which is widely believed to be empirically superior to the square loss. In this work we provide evidence indicating that this belief may not be well-founded. We

arxiv.org

ICLR 2021에 억셉된 페이퍼.

classification에서 cross-entropy를 사용하는 것은 standard이다.

왜 standard 일까? MSE를 사용해도 되지 않을까? 라는 의문에서 논문을 검색해봤고, 다행히도 classification task에서 MSE와 Cross-entropy를 비교하는 실험을 한 논문이 있었다.

다양한 task(NLP, Computer vision, Audio)와 다양한 dataset, 많은 모델에서 실험을 한다.

놀라운 점은 MSE를 사용했을 때가 성능이 높은 경우가 많다.

왜 그럴까?

이유에 대해 별다른 설명이 없다.

'논문 읽기 > Classification' 카테고리의 다른 글

[논문 읽기] Conformer(2021), Local Features Coupling Global Representations for Visual Recognition (0)	2021.09.15
[논문 읽기] CMT(2021), Convolutional Neural Networks Meet Vision Transformers (0)	2021.08.16
[논문 읽기] Scaling Vision Transformers(2021) (0)	2021.08.15
[논문 읽기] BoTNet(2021), Bottleneck Transformers for Visual Recognition (0)	2021.08.10
[논문 읽기] Early Convolutions Help Transformers See Better(2021) (0)	2021.08.09

현재글Classification에서 MSE(mean square error) vs Cross-Entropy

딥러닝 공부방

Classification에서 MSE(mean square error) vs Cross-Entropy

Evaluation of Neural Architectures Trained with Square Loss vs Cross-Entropy in Classification Tasks

'논문 읽기 > Classification' 카테고리의 다른 글

'논문 읽기/Classification'의 다른글

티스토리툴바

Classification에서 MSE(mean square error) vs Cross-Entropy

Evaluation of Neural Architectures Trained with Square Loss vs Cross-Entropy in Classification Tasks

'논문 읽기 > Classification' 카테고리의 다른 글

'논문 읽기/Classification'의 다른글

관련글

티스토리툴바