Faiaz Hasanuzzaman Rhythm

Military Institute of Science and Technology

Email Google Scholar ResearchGate ORCID GitHub

Faiaz Hasanuzzaman Rhythm is a researcher and undergraduate student in the Department of Electrical, Electronic, and Communication Engineering at the Military Institute of Science and Technology (MIST), Bangladesh. His research primarily focuses on deep learning, computer vision, and medical image segmentation, with expanding interests in transformer-based architectures, attention mechanisms, and graph learning.

Over the past few years, he has developed and contributed to multiple research projects spanning medical imaging, remote sensing, and quantum-assisted feature learning. His notable works, including UAPNet, DoubleUNet++, and MAGnet, explore the integration of attention mechanisms, uncertainty modeling, and multiscale feature learning to achieve enhanced segmentation accuracy and interpretability. He has presented and published several papers in international conferences such as ECCE, ICAEEE, QPAIN, and TEHI, along with journal submissions to Elsevier and Wiley.

His undergraduate thesis, titled “Advancing Stacked U-Nets with Cross-Stage Attention for Precise Road Mapping in Remote Sensing,” represents a culmination of his work on encoder–decoder architectures and attention-based feature refinement. Beyond research, he has served as the Chair of IEEE MIST Student Branch, Ambassador of IEEE Bangladesh Section, and Campus Ambassador for Grameenphone Academy, demonstrating strong leadership and dedication to academic and community engagement.

He possesses hands-on experience in MATLAB, Python, TensorFlow, PyTorch, Django, and a range of simulation and CAD tools. Passionate about bridging technical research with practical applications, his long-term goal is to contribute to the advancement of intelligent vision systems capable of adapting across diverse real-world challenges in healthcare, robotics, and remote sensing.

Interests

Computer Vision
Computational Biology
Bioinformatics
Remote Sensing
Satellite Images
Explainable AI

Education

B.Sc in Electrical, Electronic and Communication Engineering — Military Institute of Science and Technology (2025)
Higher Secondary Certificate (HSC) — Saint Joseph Higher Secondary School (2020)
Secondary School Certificate (SSC) — Mohammadpur Preparatory School and College (2018)

Selected Publications

DoubleUNet++: Channel-Aware Gated Attention for Road Extraction in Satellite Imagery 2025

Faiaz Hasanuzzaman Rhythm, Nomaiya Bashree, Tareque Bashar Ovi, Hussain Nyeem, Md Abdul Wahed

2025 International Conference on Quantum Photonics, Artificial Intelligence, and Networking (QPAIN)

Paper link

BibTeX

Click to copy

@INPROCEEDINGS{11172088,
  author={Rhythm, Faiaz Hasanuzzaman and Bashree, Nomaiya and Ovi, Tareque Bashar and Nyeem, Hussain and Wahed, Md Abdul},
  booktitle={2025 International Conference on Quantum Photonics, Artificial Intelligence, and Networking (QPAIN)}, 
  title={DoubleUNet++: Channel-Aware Gated Attention for Road Extraction in Satellite Imagery}, 
  year={2025},
  volume={},
  number={},
  pages={1-6},
  keywords={Image segmentation;Roads;Vegetation mapping;Computer architecture;Logic gates;Feature extraction;Data models;Spatial databases;Satellite images;Topology;Deep learning;Road extraction;Cascaded model;Segmentation;Gated attention;Skip connection;Squeeze-excitation;High-resolution satellite image},
  doi={10.1109/QPAIN66474.2025.11172088}}

Enhancing U2Net for Precise Road Extraction from Satellite Images via Channel Refinement 2025

Faiaz Hasanuzzaman Rhythm, Tareque Bashar Ovi, Nomaiya Bashree, Hussain Nyeem, Md Abdul Wahed

2025 International Conference on Quantum Photonics, Artificial Intelligence, and Networking (QPAIN)

Paper link

BibTeX

Click to copy

@INPROCEEDINGS{11171628,
  author={Rhythm, Faiaz Hasanuzzaman and Ovi, Tareque Bashar and Bashree, Nomaiya and Nyeem, Hussain and Wahed, Md Abdul},
  booktitle={2025 International Conference on Quantum Photonics, Artificial Intelligence, and Networking (QPAIN)}, 
  title={Enhancing U2Net for Precise Road Extraction from Satellite Images via Channel Refinement}, 
  year={2025},
  volume={},
  number={},
  pages={1-6},
  keywords={Deep learning;Image segmentation;Adaptation models;Roads;Computational modeling;Urban areas;Computer architecture;Satellite images;Computational efficiency;Context modeling;Road extraction;Segmentation;Skip connection;Squeeze-excitation;High-resolution satellite image},
  doi={10.1109/QPAIN66474.2025.11171628}}

No images

YESnet: YOLOv11 Enabled SAM-2 Framework for Memory-Efficient Skin Lesion Segmentation 2025

Nomaiya Bashree, Tareque Bashar Ovi, Sadia Binte Zahid, Faiaz Hasanuzzaman Rhythm, Hussain Nyeem, Md Abdul Wahed

2025 International Conference on Quantum Photonics, Artificial Intelligence, and Networking (QPAIN)

Paper link

BibTeX

Click to copy

@INPROCEEDINGS{11171738,
  author={Bashree, Nomaiya and Ovi, Tareque Bashar and Zahid, Sadia Binte and Rhythm, Faiaz Hasanuzzaman and Nyeem, Hussain and Abdul, Md},
  booktitle={2025 International Conference on Quantum Photonics, Artificial Intelligence, and Networking (QPAIN)}, 
  title={YESnet: YOLOv11 Enabled SAM-2 Framework for Memory-Efficient Skin Lesion Segmentation}, 
  year={2025},
  volume={},
  number={},
  pages={1-5},
  keywords={Image segmentation;Accuracy;Image analysis;Pipelines;Redundancy;Memory management;Detectors;Skin;Lesions;Photonics;Deep learning;skin lesion segmentation;foundational model;nano YOLOv11 architecture;SAM-2},
  doi={10.1109/QPAIN66474.2025.11171738}}

Bridging Classical and Quantum Models via Attention-Guided Feature Distillation 2025

Tareque Bashar Ovi, Nomaiya Bashree, Faiaz Hasanuzzaman Rhythm, Disha Chowdhury, Hussain Nyeem, Md Abdul Wahed

2025 2nd International Conference on Next-Generation Computing, IoT and Machine Learning (NCIM)

Paper link

BibTeX

Click to copy

@INPROCEEDINGS{11159974,
  author={Ovi, Tareque Bashar and Bashree, Nomaiya and Rhythm, Faiaz Hasanuzzaman and Chowdhury, Disha and Nyeem, Hussain and Wahed, Md Abdul},
  booktitle={2025 2nd International Conference on Next-Generation Computing, IoT and Machine Learning (NCIM)}, 
  title={Bridging Classical and Quantum Models via Attention-Guided Feature Distillation}, 
  year={2025},
  volume={},
  number={},
  pages={1-6},
  keywords={Training;Accuracy;Qubit;Transfer learning;Neural networks;Computer architecture;Logic gates;Quantum networks;Next generation networking;Image classification;Deep learning;knowledge distillation;amplitude encoding;transfer learning;QML},
  doi={10.1109/NCIM65934.2025.11159974}}

Optimizing Monocular Depth Estimation through Bi-Level Nested Architecture Integration 2025

Faiaz Hasanuzzaman Rhythm, Tareque Bashar Ovi, Nomaiya Bashree, Md. Raisul Islam Ratul, Hussain Nyeem, Md Abdul Wahed

2025 International Conference on Electrical, Computer and Communication Engineering (ECCE)

Paper link

BibTeX

Click to copy

@INPROCEEDINGS{11013114,
  author={Rhythm, Faiaz Hasanuzzaman and Bashar Ovi, Tareque and Bashree, Nomaiya and Islam Ratul, Md. Raisul and Nyeem, Hussain and Wahed, Md Abdul},
  booktitle={2025 International Conference on Electrical, Computer and Communication Engineering (ECCE)}, 
  title={Optimizing Monocular Depth Estimation through Bi-Level Nested Architecture Integration}, 
  year={2025},
  volume={},
  number={},
  pages={1-6},
  keywords={Deep learning;Adaptation models;Visualization;Depth measurement;Computational modeling;Reliability engineering;Robustness;Real-time systems;Computational efficiency;Robots;deep learning;depth estimation;contextual and local information;residual u-block;nested u-structure},
  doi={10.1109/ECCE64574.2025.11013114}}

Utilizing Reverse Attention for Enhanced Mitochondria Segmentation in Microscopic Images 2025

Ayat Subah Alam, Faiaz Hasanuzzaman Rhythm, Tareque Bashar Ovi, Nomaiya Bashree, Hussain Nyeem, Md Abdul Wahed

2025 International Conference on Electrical, Computer and Communication Engineering (ECCE)

Paper link

BibTeX

Click to copy

@INPROCEEDINGS{11013970,
  author={Alam, Ayat Subah and Rhythm, Faiaz Hasanuzzaman and Ovi, Tareque Bashar and Bashree, Nomaiya and Nyeem, Hussain and Wahed, Md Abdul},
  booktitle={2025 International Conference on Electrical, Computer and Communication Engineering (ECCE)}, 
  title={Utilizing Reverse Attention for Enhanced Mitochondria Segmentation in Microscopic Images}, 
  year={2025},
  volume={},
  number={},
  pages={1-6},
  keywords={Image segmentation;Adaptation models;Mitochondria;Accuracy;Image color analysis;Microscopy;Computer architecture;Transformers;Real-time systems;Biomedical imaging;deep learning;medical image segmentation;contextual information;mitochondria segmentation;electron-microscopy images;reverse attention},
  doi={10.1109/ECCE64574.2025.11013970}}

A Novel Dual Attention Approach for DNN Based Automated Diabetic Retinopathy Grading 2024

Tareque Bashar Ovi, Nomaiya Bashree, Hussain Nyeem, Md Abdul Wahed, Faiaz Hasanuzzaman Rhythm, Ayat Subah Alam

International Journal of Imaging Systems and Technology

Paper link

BibTeX

Click to copy

@article{https://doi.org/10.1002/ima.23175,
author = {Ovi, Tareque Bashar and Bashree, Nomaiya and Nyeem, Hussain and Wahed, Md Abdul and Rhythm, Faiaz Hasanuzzaman and Alam, Ayat Subah},
title = {A Novel Dual Attention Approach for DNN Based Automated Diabetic Retinopathy Grading},
journal = {International Journal of Imaging Systems and Technology},
volume = {34},
number = {5},
pages = {e23175},
doi = {https://doi.org/10.1002/ima.23175},
url = {https://onlinelibrary.wiley.com/doi/abs/10.1002/ima.23175},
eprint = {https://onlinelibrary.wiley.com/doi/pdf/10.1002/ima.23175},
note = {e23175 IMA-24-504.R1},
abstract = {ABSTRACT Diabetic retinopathy (DR) poses a serious threat to vision, emphasising the need for early detection. Manual analysis of fundus images, though common, is error-prone and time-intensive. Existing automated diagnostic methods lack precision, particularly in the early stages of DR. This paper introduces the Soft Convolutional Block Attention Module-based Network (Soft-CBAMNet), a deep learning network designed for severity detection, which features Soft-CBAM attention to capture complex features from fundus images. The proposed network integrates both the convolutional block attention module (CBAM) and the soft-attention components, ensuring simultaneous processing of input features. Following this, attention maps undergo a max-pooling operation, and refined features are concatenated before passing through a dropout layer with a dropout rate of 50\%. Experimental results on the APTOS dataset demonstrate the superior performance of Soft-CBAMNet, achieving an accuracy of 85.4\% in multiclass DR grading. The proposed architecture has shown strong robustness and general feature learning capability, achieving a mean AUC of 0.81 on the IDRID dataset. Soft-CBAMNet's dynamic feature extraction capability across all classes is further justified by the inspection of intermediate feature maps. The model excels in identifying all stages of DR with increased precision, surpassing contemporary approaches. Soft-CBAMNet presents a significant advancement in DR diagnosis, offering improved accuracy and efficiency for timely intervention.},
year = {2024}
}

A Transformer-based Text-Guided Approach for Improved Colonoscopic Polyp Segmentation 2024

Tareque Bashar Ovi, Nomaiya Bashree, Hussain Nyeem, Md Abdul Wahed, Ayat Subah Alam, Faiaz Hasanuzzaman Rhythm

2024 3rd International Conference on Advancement in Electrical and Electronic Engineering (ICAEEE)

Paper link

BibTeX

Click to copy

@INPROCEEDINGS{10561827,
  author={Ovi, Tareque Bashar and Bashree, Nomaiya and Nyeem, Hussain and Wahed, Md Abdul and Alam, Ayat Subah and Rhythm, Faiaz Hasanuzzaman},
  booktitle={2024 3rd International Conference on Advancement in Electrical and Electronic Engineering (ICAEEE)}, 
  title={A Transformer-based Text-Guided Approach for Improved Colonoscopic Polyp Segmentation}, 
  year={2024},
  volume={},
  number={},
  pages={1-6},
  keywords={Training;Deep learning;Image segmentation;Image coding;Colonic polyps;Predictive models;Transformers;deep learning;medical image segmentation;contextual information;text guided attention;transformer},
  doi={10.1109/ICAEEE62219.2024.10561827}}

🧪 Ongoing Research Projects

UAPNet: Uncertainty Augmented Pyramid Vision Transformer Network With Efficient Channel Filtering for Polyp Segmentation The 5th International Conference on Trends in Electronics and Health Informatics (TEHI 2025) , 2025

Objective

To mitigate feature redundancy in deep networks, we integrate the Efficient Channel Attention (ECA) module with the PVT encoder and augment it with an Uncertainty Augmented Context Attention (UACA) mechanism.

Key Findings

PVT with Uncertainty-Augmented Context Attention. We integrate the PVT encoder with an uncertainty-augmented context attention mechanism, enabling the model to focus explicitly on ambiguous boundary regions. This mechanism improves the accuracy of boundary delineation by using uncertainty as a guiding signal.
Integrating ECA with PVT Based Encoder. We incorporate ECA after each PVT stage, which refines feature representations while maintaining the original dimensionality. This enhancement improves the model’s discriminative power without introducing additional computational overhead.
Explainability for Clinical Reliability. To ensure transparency and clinical applicability, we apply explainable AI (XAI) techniques, including uncertainty maps and heatmaps for foreground and edge regions. These methods improve interpretability and provide clinicians with more reliable decision making tools by highlighting areas of uncertainty and focus during polyp segmentation.

Photos

Performance Analysis of Semi-Supervised Frameworks for Polyp Segmentation The 5th International Conference on Trends in Electronics and Health Informatics (TEHI 2025) , 2025

Investigating Adversarial Resilience of Popular Models For Polyp And Skin Lesion Segmentation The 5th International Conference on Trends in Electronics and Health Informatics (TEHI 2025) , 2025

Comparative Insight of Scalable Graph Based Spatial Domain Discovery From Stereo-Seq and Slide-Seq Data Taylor and Francis Book Chapter, 3rd International Conference on Big Data, IoT and Machine Learning (BIM 2025) , 2025

Objective

To systematically evaluate the performance and scalability of leading GNN-based spatial transcriptomics (ST) models—STAGATE, GraphST, and SEDR—on high-density next-generation datasets such as Stereo-seq and Slide-seq, and to identify their limitations in modeling complex spatial structures.

Key Findings

Performance Evaluation: Comprehensive assessment using ARI, AMI, NMI, and HOMO metrics revealed modest and variable clustering performance across high-density datasets.
- SEDR achieved the highest scores on Slide-seq datasets, but overall accuracy remained low.
- Stereo-seq results were inconsistent, with no single model consistently outperforming others.
Scalability Limitation: Existing GNN-based ST models struggle to generalize to high-resolution data, indicating poor scalability to large, complex tissue structures.
Benchmark Establishment: This study provides a performance baseline for evaluating future models on next-generation ST platforms.
Research Implication: Highlights the urgent need for developing new computational frameworks that are more robust, adaptive, and scalable for analyzing dense spatial transcriptomics data.

Photos

Ensemble Based Graph Attention Auto-Encoder Architecture For Unsupervised Spatial Clustering Taylor and Francis Book Chapter, 3rd International Conference on Big Data, IoT and Machine Learning (BIM 2025) , 2025

Objective

To overcome the limitation of fixed spatial neighborhood scales in existing spatial transcriptomics (ST) models by developing En-STAGATE, an ensemble framework that integrates multiple spatial graphs across varying neighborhood radii to capture both local and global tissue structures for improved domain identification.

Key Findings

Multi-Scale Ensemble Design: En-STAGATE constructs and fuses representations from multiple spatial graphs at different neighborhood radii, effectively modeling both fine-grained cellular interactions and large-scale tissue organization.
Enhanced Biological Interpretability: The multi-scale integration provides a more comprehensive and biologically meaningful latent representation of tissues.
Empirical Validation: Achieved competitive or superior performance across 14 benchmark datasets, outperforming STAGATE and other leading methods in several key cases.
Improved Domain Delineation: Notably higher Adjusted Rand Index (ARI) scores on datasets such as 151674, 151675, 151676, and Mouse, demonstrating robustness in complex tissue structures.
Key Insight: Highlights that a single neighborhood scale is insufficient for spatial domain identification, establishing multi-scale analysis as a critical paradigm in spatial transcriptomics.

Photos

Proximity Enhanced Multi-Modal Graph Based Spatial Transcriptomics Clustering Taylor and Francis Book Chapter, 3rd International Conference on Big Data, IoT and Machine Learning (BIM 2025) , 2025

Objective

To address the over-smoothing and spatial bias in current spatial transcriptomics (ST) models by developing GraphST++, a multi-modal framework that integrates spatial, gene expression, and histological features for constructing biologically faithful tissue graphs.

Key Findings

Multi-Modal Graph Construction: GraphST++ defines inter-spot relationships using a composite score combining spatial distance, gene expression similarity, and histological morphology, reducing over-reliance on proximity-based connections.
Improved Tissue Representation: The proposed approach captures complex biological interactions and produces a more accurate reflection of the tissue microenvironment.
Superior Performance: Demonstrated higher Adjusted Rand Index (ARI) compared to leading models, including GraphST, across multiple benchmark datasets such as human breast cancer (BRCA) and human brain tissues.
Robustness Across Datasets: While not universally dominant, GraphST++ consistently benefits from morphological integration, yielding more stable and biologically interpretable clustering results.
Broader Implications: Highlights the importance of multi-modal fusion in advancing spatial transcriptomics analysis and enabling more precise tissue domain identification.

Photos

Transformer Enhanced Graph Based Spatial Domain Identification Taylor and Francis Book Chapter, 3rd International Conference on Big Data, IoT and Machine Learning (BIM 2025) , 2025

Objective

To overcome the limitation of existing graph-based spatial transcriptomics (ST) models in capturing long-range spatial dependencies by developing STAGATE++, a hybrid GAT–Transformer architecture that jointly learns local and global tissue representations.

Key Findings

Hybrid Architecture Advantage: STAGATE++ integrates a Graph Attention Network (GAT) for precise local feature extraction with a Transformer encoder for modeling long-range spatial dependencies.
Enhanced Spatial Awareness: The model constructs a holistic and spatially coherent tissue representation, improving understanding of tissue organization.
Empirical Superiority: Demonstrated state-of-the-art performance across 14 benchmark datasets, consistently outperforming existing methods in spatial domain identification.
Validation Metric: Significant improvements in Adjusted Rand Index (ARI) confirm the importance of global context modeling in tissue delineation.
Future Extensions: Plans include incorporating histopathological image features, applying self-supervised training for better generalization, and integrating GNN interpretability techniques for biological insight and clinical transparency.

Photos

Experiences

Work Experiences

2024 Industrial Training 🏛 Mango Teleservices Ltd.

2024 Industrial Training 🏛 Grameenphone Ltd.

Professional Affiliation

2024-2025 Campus Ambassador 🏛 Grameenphone Academy

2024-2025 Ambassador 🏛 EEE Bangladesh Section

2024-2025 Chairperson 🏛 IEEE MIST Student Branch

2023-2024 ICEEICT 2024 Website Development & Maintenance 🏛 Military Institute of Science and Technology

Link

2024 Instructor: Workshop on LATEX 🏛 IEEE MIST Student Branch

2023-2024 Team Leader: ML & Micro-Controller 🏛 MIST Innovation Club

2024 Campus Ambassador 🏛 Student Professional Awareness Conference (SPAC)

Link

Profile