Tokdash - Token Usage Dashboard
A dashboard for monitoring and visualizing token usage across LLM API calls.
A dashboard for monitoring and visualizing token usage across LLM API calls.
A flood warning system for the UK, built with Python for real-time monitoring of flood risk data.
An agent system that interprets and responds to gesture inputs using computer vision.
Comprehensive notes and materials for the Cambridge Engineering 3F3 Statistical Signal Processing course.
Course notes for Cambridge Engineering 3F7 covering information theory, source coding, and channel coding.
Data analysis project exploring correlations between gene expression levels and tumor characteristics.
Exercises and examples for learning concurrent programming patterns in C++.
Implementation and experiments with blue noise sampling algorithms.
Published in China National Intellectual Property Administration
A method for optimizing fault-tolerant neural network architectures using Bayesian optimization, addressing resistance variation and bit-flip in ReRAM devices.
Recommended citation: N. Ye, Z. Fang, J. Mei. "Method, System, Medium, and Electronic Device for Optimizing Fault-Tolerant Neural Network Structure." CN Patent CN113,570,056 A, 2021.
Published in DAC 2021 (Design Automation Conference)
We propose BayesFT, a Bayesian optimization framework for designing fault-tolerant neural network architectures. Our approach efficiently searches the architecture space to find models that maintain performance under hardware faults, achieving state-of-the-art results on fault-tolerant neural network design.
Recommended citation: N. Ye*, J. Mei*, Z. Fang, Y. Zhang, Z. Zhang, H. Wu, X. Liang. "BayesFT: Bayesian Optimization for Fault Tolerant Neural Network Architecture." DAC 2021.
Published in NeurIPS 2023
We propose FLMR, a fine-grained late-interaction multi-modal retrieval model for retrieval-augmented visual question answering. FLMR extends the ColBERT architecture to handle multi-modal queries and documents, enabling effective retrieval for knowledge-intensive VQA tasks.
Recommended citation: W. Lin, J. Chen, J. Mei, A. Coca, B. Byrne. "Fine-Grained Late-Interaction Multi-Modal Retrieval for Retrieval Augmented Visual Question Answering." NeurIPS 2023.
Download Paper
Published in European Patent Office
A novel non-attention-based transformer architecture for efficient on-device streaming automatic speech recognition, achieving a 50% reduction in model size and computation along with a 60% decrease in end-to-end latency.
Recommended citation: J. Mei, Z. Zhang. "Apparatus and Method for Streaming Automatic Speech Recognition." EP Patent EP4404187A1, 2024.
Published in NAACL 2024 Main
We introduce Control-DAG, a constrained decoding method for non-autoregressive Directed Acyclic T5 using weighted finite state automata. This enables efficient constrained generation while maintaining the speed advantages of non-autoregressive models.
Recommended citation: J. Chen, W. Lin, J. Mei, B. Byrne. "Control-DAG: Constrained Decoding for Non-Autoregressive Directed Acyclic T5 using Weighted Finite State Automata." NAACL 2024 Main.
Download Paper
Published in ACL 2024 Main
We introduce PreFLMR, a large-scale pre-trained fine-grained late-interaction multi-modal retriever. By scaling up both the model and pre-training data, PreFLMR achieves state-of-the-art performance on a wide range of multi-modal retrieval benchmarks.
Recommended citation: W. Lin*, J. Mei*, J. Chen*, B. Byrne. "PreFLMR: Scaling Up Fine-Grained Late-Interaction Multi-Modal Retrievers." ACL 2024 Main.
Download Paper
Published in ACL 2024 Main
We propose RGCL, a retrieval-guided contrastive learning framework for hateful meme detection. By retrieving similar memes and using them as contrastive examples, our model learns more discriminative representations for distinguishing hateful from benign memes.
Recommended citation: J. Mei, J. Chen, W. Lin, B. Byrne, M. Tomalin. "Improving Hateful Meme Detection through Retrieval-Guided Contrastive Learning." ACL 2024 Main.
Download Paper
Published in EMNLP 2025 Main (Oral)
We propose a robust adaptation approach for large multimodal models in retrieval-augmented hateful meme detection. Our method effectively leverages retrieved examples to improve detection performance while maintaining robustness against noisy retrievals.
Recommended citation: J. Mei, J. Chen, G. Yang, W. Lin, B. Byrne. "Robust Adaptation of Large Multimodal Models for Retrieval Augmented Hateful Meme Detection." EMNLP 2025 Main (Oral).
Published in NeurIPS 2025
We extend Direct Preference Optimization (DPO) to handle tied preferences, where annotators cannot distinguish between two responses. Our method provides a principled framework for incorporating ties into preference-based alignment of language models.
Recommended citation: J. Chen, G. Yang, W. Lin, J. Mei, B. Byrne. "On Extending Direct Preference Optimization to Accommodate Ties." NeurIPS 2025.
Published in ICLR 2026
We propose ExPO-HM, a novel explain-then-detect framework for hateful meme detection. By first generating explanations of why a meme might be hateful and then using these explanations to guide detection, our model achieves improved performance and interpretability.
Recommended citation: J. Mei, M. Sun, J. Chen, P. Qin, Y. Li, D. Chen, B. Byrne. "ExPO-HM: Learning to Explain-then-Detect for Hateful Meme Detection." ICLR 2026.
Published in arXiv preprint
We introduce ATM-Bench, a benchmark for long-term personalized referential memory question answering. Our work addresses the challenge of building AI systems that can maintain and reason over personalized memory across long-term interactions.
Recommended citation: J. Mei, J. Chen, G. Yang, X. Hou, M. Li, B. Byrne. "According to Me: Long-Term Personalized Referential Memory QA." arXiv preprint arXiv:2603.01990.
Download Paper
Published in arXiv preprint
A novel approach for controllable multi-label video safety detection using adaptive Tversky policy optimization, enabling fine-grained safety classification of video content.
Recommended citation: Controllable Multi-label Video Safety Detection via Adaptive Tversky Policy Optimization.
Published in ACL 2026 Main
We introduce RAD, a retrieval-augmented defense mechanism for preventing jailbreak attacks on large language models. Our approach adaptively retrieves relevant safety guidelines to provide controllable and effective jailbreak prevention.
Recommended citation: G. Yang, J. Chen, J. Mei, W. Lin, B. Byrne. "Retrieval-Augmented Defense: Adaptive and Controllable Jailbreak Prevention for Large Language Models." ACL 2026 Main.
Download Paper