ExPO-HM: Learning to Explain-then-Detect for Hateful Meme Detection

Published in ICLR 2026

We propose ExPO-HM, a novel explain-then-detect framework for hateful meme detection. By first generating explanations of why a meme might be hateful and then using these explanations to guide detection, our model achieves improved performance and interpretability.

Code

Recommended citation: J. Mei, M. Sun, J. Chen, P. Qin, Y. Li, D. Chen, B. Byrne. "ExPO-HM: Learning to Explain-then-Detect for Hateful Meme Detection." ICLR 2026.

Share on

Bluesky Facebook LinkedIn Mastodon X (formerly Twitter)

Jingbiao Mei

Share on