Posts by Collection

portfolio

Gesture Agent

An agent system that interprets and responds to gesture inputs using computer vision.

publications

Method, System, Medium, and Electronic Device for Optimizing Fault-Tolerant Neural Network Structure

Published in China National Intellectual Property Administration

A method for optimizing fault-tolerant neural network architectures using Bayesian optimization, addressing resistance variation and bit-flip in ReRAM devices.

Recommended citation: N. Ye, Z. Fang, J. Mei. "Method, System, Medium, and Electronic Device for Optimizing Fault-Tolerant Neural Network Structure." CN Patent CN113,570,056 A, 2021.

BayesFT: Bayesian Optimization for Fault Tolerant Neural Network Architecture

Published in DAC 2021 (Design Automation Conference)

We propose BayesFT, a Bayesian optimization framework for designing fault-tolerant neural network architectures. Our approach efficiently searches the architecture space to find models that maintain performance under hardware faults, achieving state-of-the-art results on fault-tolerant neural network design.

Recommended citation: N. Ye*, J. Mei*, Z. Fang, Y. Zhang, Z. Zhang, H. Wu, X. Liang. "BayesFT: Bayesian Optimization for Fault Tolerant Neural Network Architecture." DAC 2021.

Fine-Grained Late-Interaction Multi-Modal Retrieval for Retrieval Augmented Visual Question Answering

Published in NeurIPS 2023

We propose FLMR, a fine-grained late-interaction multi-modal retrieval model for retrieval-augmented visual question answering. FLMR extends the ColBERT architecture to handle multi-modal queries and documents, enabling effective retrieval for knowledge-intensive VQA tasks.

Recommended citation: W. Lin, J. Chen, J. Mei, A. Coca, B. Byrne. "Fine-Grained Late-Interaction Multi-Modal Retrieval for Retrieval Augmented Visual Question Answering." NeurIPS 2023.
Download Paper

Apparatus and Method for Streaming Automatic Speech Recognition

Published in European Patent Office

A novel non-attention-based transformer architecture for efficient on-device streaming automatic speech recognition, achieving a 50% reduction in model size and computation along with a 60% decrease in end-to-end latency.

Recommended citation: J. Mei, Z. Zhang. "Apparatus and Method for Streaming Automatic Speech Recognition." EP Patent EP4404187A1, 2024.

Control-DAG: Constrained Decoding for Non-Autoregressive Directed Acyclic T5 using Weighted Finite State Automata

Published in NAACL 2024 Main

We introduce Control-DAG, a constrained decoding method for non-autoregressive Directed Acyclic T5 using weighted finite state automata. This enables efficient constrained generation while maintaining the speed advantages of non-autoregressive models.

Recommended citation: J. Chen, W. Lin, J. Mei, B. Byrne. "Control-DAG: Constrained Decoding for Non-Autoregressive Directed Acyclic T5 using Weighted Finite State Automata." NAACL 2024 Main.
Download Paper

PreFLMR: Scaling Up Fine-Grained Late-Interaction Multi-Modal Retrievers

Published in ACL 2024 Main

We introduce PreFLMR, a large-scale pre-trained fine-grained late-interaction multi-modal retriever. By scaling up both the model and pre-training data, PreFLMR achieves state-of-the-art performance on a wide range of multi-modal retrieval benchmarks.

Recommended citation: W. Lin*, J. Mei*, J. Chen*, B. Byrne. "PreFLMR: Scaling Up Fine-Grained Late-Interaction Multi-Modal Retrievers." ACL 2024 Main.
Download Paper

Improving Hateful Meme Detection through Retrieval-Guided Contrastive Learning

Published in ACL 2024 Main

We propose RGCL, a retrieval-guided contrastive learning framework for hateful meme detection. By retrieving similar memes and using them as contrastive examples, our model learns more discriminative representations for distinguishing hateful from benign memes.

Recommended citation: J. Mei, J. Chen, W. Lin, B. Byrne, M. Tomalin. "Improving Hateful Meme Detection through Retrieval-Guided Contrastive Learning." ACL 2024 Main.
Download Paper

Robust Adaptation of Large Multimodal Models for Retrieval Augmented Hateful Meme Detection

Published in EMNLP 2025 Main (Oral)

We propose a robust adaptation approach for large multimodal models in retrieval-augmented hateful meme detection. Our method effectively leverages retrieved examples to improve detection performance while maintaining robustness against noisy retrievals.

Recommended citation: J. Mei, J. Chen, G. Yang, W. Lin, B. Byrne. "Robust Adaptation of Large Multimodal Models for Retrieval Augmented Hateful Meme Detection." EMNLP 2025 Main (Oral).

On Extending Direct Preference Optimization to Accommodate Ties

Published in NeurIPS 2025

We extend Direct Preference Optimization (DPO) to handle tied preferences, where annotators cannot distinguish between two responses. Our method provides a principled framework for incorporating ties into preference-based alignment of language models.

Recommended citation: J. Chen, G. Yang, W. Lin, J. Mei, B. Byrne. "On Extending Direct Preference Optimization to Accommodate Ties." NeurIPS 2025.

ExPO-HM: Learning to Explain-then-Detect for Hateful Meme Detection

Published in ICLR 2026

We propose ExPO-HM, a novel explain-then-detect framework for hateful meme detection. By first generating explanations of why a meme might be hateful and then using these explanations to guide detection, our model achieves improved performance and interpretability.

Recommended citation: J. Mei, M. Sun, J. Chen, P. Qin, Y. Li, D. Chen, B. Byrne. "ExPO-HM: Learning to Explain-then-Detect for Hateful Meme Detection." ICLR 2026.

According to Me: Long-Term Personalized Referential Memory QA

Published in arXiv preprint

We introduce ATM-Bench, a benchmark for long-term personalized referential memory question answering. Our work addresses the challenge of building AI systems that can maintain and reason over personalized memory across long-term interactions.

Recommended citation: J. Mei, J. Chen, G. Yang, X. Hou, M. Li, B. Byrne. "According to Me: Long-Term Personalized Referential Memory QA." arXiv preprint arXiv:2603.01990.
Download Paper

Retrieval-Augmented Defense: Adaptive and Controllable Jailbreak Prevention for Large Language Models

Published in ACL 2026 Main

We introduce RAD, a retrieval-augmented defense mechanism for preventing jailbreak attacks on large language models. Our approach adaptively retrieves relevant safety guidelines to provide controllable and effective jailbreak prevention.

Recommended citation: G. Yang, J. Chen, J. Mei, W. Lin, B. Byrne. "Retrieval-Augmented Defense: Adaptive and Controllable Jailbreak Prevention for Large Language Models." ACL 2026 Main.
Download Paper

talks

teaching