K M Naimul Hassan — Portfolio

Latest

News & Updates

June 2026 SCANS — accepted to Interspeech 2026! Accepted

May 2026 MAESTRO — A Multimodal Auditory-attention Egocentric Speech-TRacking Open corpus — submitted to IEEE TASLP. Submitted

March 2026 SCANS — Supervised Contrastive temporal Alignment of Neural response and Speech stimuli — submitted to Interspeech 2026. Under Review

Apr 2025 Received the CCBS Summer Graduate Research Award 2025 from the Center for Cognitive and Brain Sciences, OSU. Award

Nov 2024 Received the IEEE Signal Processing Society (SPS) Scholarship 2024. Award

Aug 2023 Started Ph.D. in Computer Science & Engineering at The Ohio State University as a Graduate Research Associate in the ASPIRE Group. New

Jul 2023 Successfully defended M.Sc. thesis: "Medical Sound Event Detection Using Audio Spectrogram Fourier Network" at BUET.

Focus Areas

Research Interests

🧠

Brain-Computer Interface

Decoding cognitive states from neural signals for real-world assistive systems

🤖

Neuro AI

Bridging neuroscience and deep learning to model auditory attention and perception

🎙️

Audio & Speech Processing

Signal processing and deep learning for speech understanding and sound event detection

👁️

Multimodal Learning

Cross-modal representation learning across EEG, speech, gaze, and video

🏥

AI for Healthcare

Privacy-preserving clinical audio AI and accessible assistive technologies

🎧

Speech Perception

Neural correlates of auditory attention in naturalistic listening environments

Academic Background

Education

Aug 2023
Present

Doctor of Philosophy (Ph.D.)

The Ohio State University

Department of Computer Science & Engineering · Columbus, Ohio, USA

Research focus on EEG-based auditory attention decoding, contrastive learning for neural-speech alignment, and multimodal BCI systems. Advisor: Prof. Donald Williamson, ASPIRE Group.

Jul 2021
Jul 2023

Master of Science (M.Sc.)

Bangladesh University of Engineering and Technology (BUET)

Department of Electrical & Electronic Engineering · Dhaka, Bangladesh

Thesis: Medical Sound Event Detection Using Audio Spectrogram Fourier Network. Designed an attention-free transformer using FFT-based sublayers achieving significant improvements over Audio Spectrogram Transformer.

Feb 2016
Feb 2021

Bachelor of Science (B.Sc.)

Bangladesh University of Engineering and Technology (BUET)

Department of Electrical & Electronic Engineering · Dhaka, Bangladesh

Work History

Experience

Aug 2023
Present

Graduate Research Associate

The Ohio State University — ASPIRE Group

Columbus, Ohio, USA

Developing a reinforcement learning framework for real-time auditory attention decoding to control neuro-steered hearing aids.
Collecting and analyzing synchronized multimodal brain and acoustic data to study neural correlates of speech attention.
Developing a contrastive learning framework (SCANS) to align brain signals with speech stimuli.

Jul 2021
Jul 2023

Research Assistant

Bangladesh University of Engineering and Technology (BUET)

Department of EEE · Dhaka, Bangladesh

Built a privacy-preserving cough detection pipeline using audio source separation (Wave-U-Net).
Designed an efficient attention-free transformer for medical sound event detection.

Research Output

Publications

NeuroAI

Accepted

SCANS: Supervised Contrastive temporal Alignment of Neural response and Speech stimuli

K. M. N. Hassan and D. Williamson

Interspeech 2026

Submitted

MAESTRO: A Multimodal Auditory-attention Egocentric Speech-TRacking Open corpus

K. M. N. Hassan, Seyed Ali Alavi, and D. Williamson

IEEE Transactions on Audio, Speech, and Language Processing, 2026

AI in Healthcare & Accessibility

Published

SS+CEDNet: A Speech Privacy Aware Cough Detection Pipeline by Separating Sources

K. M. N. Hassan and M. A. Haque

IEEE R10 Humanitarian Technology Conference (R10-HTC), 2022

Code Paper

Published

ALSNet: A Dilated 1-D CNN for Identifying ALS from Raw EMG Signal

K. M. N. Hassan et al.

IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022

Code Paper

Published

A Dual-Purpose Refreshable Braille Display Based on Real Time Object Detection and OCR

K. M. N. Hassan, S. K. Biswas, M. S. Anwar, M. S. Iman Siam, and C. Shahnaz

IEEE SPICSCON, 2019

Code Paper

Audio & Speech Processing

Published

DOANet: A Deep Dilated CNN Approach for Search and Rescue with Drone-Embedded Sound Source Localization

A. B. A. Qayyum, K. M. N. Hassan, A. Anika et al.

EURASIP Journal on Audio, Speech, and Music Processing, 2020

Code Paper

Published

Direction of Arrival Estimation through Noise Suppression: A Novel Approach using GSC Beamforming

A. B. A. Qayyum, A. Anika, M. M. M. Miah, M. M. Rahman, K. M. N. Hassan et al.

IEEE SPICSCON, 2019

Paper

Selected Work

Projects

PhD Research

Belief-State RL for EEG-Based Auditory Attention Detection

A POMDP-based reinforcement learning system that decodes auditory attention from EEG brain signals for real-time, uncertainty-aware control of neuro-steered hearing aids.

EEGRLPOMDPBCI

PhD Research

MAESTRO: Multimodal Speech Attention Dataset

Collected and synchronized hundreds of hours of multimodal EEG, eye gaze, head motion, audio, and video data to identify neural correlates of speech attention.

EEGEye GazeDatasetMultimodal

PhD Research · Interspeech 2026

SCANS: Neural-Speech Contrastive Alignment

Supervised contrastive learning framework using dilated convolutions and cross-modal attention to temporally align EEG signals with speech stimuli in naturalistic listening.

Contrastive LearningEEGSpeech

MSc Research · IEEE R10-HTC 2022

SS+CEDNet: Privacy-Preserving Cough Detection

Pipeline using Wave-U-Net for audio source separation to enable privacy-preserving cough detection from ambient audio, improving accuracy while protecting speech privacy.

AudioHealthcareSource Sep.

MSc Research · IEEE ICASSP 2022

ALSNet: ALS Detection from Raw EMG

Dilated 1D CNN for end-to-end identification of ALS from raw EMG signals without hand-crafted feature extraction. Achieved 97.74% overall accuracy.

EMGClinical AICNN

IEEE SP Cup 2020 · 2nd Runner-up

Unsupervised Anomaly Detection in Multimodal Autonomous Systems

LSTM autoencoder for IMU sensor signals and convolutional autoencoder on optical-flow features for video, with a parametric anomaly score fusion strategy.

Anomaly DetectionLSTMIMU

IEEE SP Cup 2019 · World Rank 10

DOANet: Drone-Embedded Sound Source Localization

Deep dilated CNN estimating direction of arrival from multi-channel audio on a UAV, enabling drone-based search and rescue without hand-crafted features or ego-noise reduction.

DOADroneMicrophone Array

IEEE YESIST12 2019 · National Champion

Refreshable Braille Display with Real-Time Object Detection

Dual-purpose assistive device for visually impaired users with real-time object detection and OCR, integrated with a refreshable Braille display for portable reading and environmental awareness.

AccessibilityOCRYOLO

Amazon Alexa Prize 2022

Intelligent Dialog Management for Social Bots

Modular conversational agent with NLU, dialog management, and response generation, integrating intent/entity recognition, sentiment modeling, and neural response generation.

NLPDialogAlexa

Recognition

Honors & Awards

🏆

CCBS Summer Graduate Research Award (2025)

Center for Cognitive and Brain Sciences · The Ohio State University

🎖️

IEEE Signal Processing Society (SPS) Scholarship (2024)

IEEE Signal Processing Society

🏅

CSE Scarlet and Gray Award (2023 – Present)

Department of Computer Science & Engineering · The Ohio State University

🥉

2nd Runner-up · IEEE Signal Processing Cup 2020

Unsupervised Anomaly Detection in Multimodal Autonomous Systems · ICASSP 2020, Barcelona

🥈

1st Runner-up · IEEE Video and Image Processing Cup 2019

Privacy-aware Office Activity Recognition from FPV Body Cameras · ICIP 2019, Taipei

🌍

National Champion & World Finalist · IEEE YESIST12 Innovation Challenge 2019

Refreshable Braille Display · Final at Stamford University, Hua Hin, Thailand

🌐

World Rank 10 · IEEE Signal Processing Cup 2019

Search & Rescue with Drone-Embedded Sound Source Localization

🎓

Post-Graduate Fellowship (M.Sc.) (2021 – 2023)

Department of EEE · Bangladesh University of Engineering and Technology

K M NaimulHassan

News & Updates

Research Interests

Brain-Computer Interface

Neuro AI

Audio & Speech Processing

Multimodal Learning

AI for Healthcare

Speech Perception

Education

Doctor of Philosophy (Ph.D.)

Master of Science (M.Sc.)

Bachelor of Science (B.Sc.)

Experience

Graduate Research Associate

Research Assistant

Publications

Projects

Belief-State RL for EEG-Based Auditory Attention Detection

MAESTRO: Multimodal Speech Attention Dataset

SCANS: Neural-Speech Contrastive Alignment

SS+CEDNet: Privacy-Preserving Cough Detection

ALSNet: ALS Detection from Raw EMG

Unsupervised Anomaly Detection in Multimodal Autonomous Systems

DOANet: Drone-Embedded Sound Source Localization

Refreshable Braille Display with Real-Time Object Detection

Intelligent Dialog Management for Social Bots

Honors & Awards

CCBS Summer Graduate Research Award (2025)

IEEE Signal Processing Society (SPS) Scholarship (2024)

CSE Scarlet and Gray Award (2023 – Present)

2nd Runner-up · IEEE Signal Processing Cup 2020

1st Runner-up · IEEE Video and Image Processing Cup 2019

National Champion & World Finalist · IEEE YESIST12 Innovation Challenge 2019

World Rank 10 · IEEE Signal Processing Cup 2019

Post-Graduate Fellowship (M.Sc.) (2021 – 2023)

Skills

Programming Languages

ML / DL Frameworks

Hardware & IoT

Other

Contact

K M Naimul
Hassan