2025-06-26 |
StruMamba3D: Exploring Structural Mamba for Self-supervised Point Cloud Representation Learning |
Chuxin Wang et.al. |
2506.21541 |
null |
2025-06-25 |
EAGLE: An Efficient Global Attention Lesion Segmentation Model for Hepatic Echinococcosis |
Jiayan Chen et.al. |
2506.20333 |
null |
2025-06-24 |
FlightKooba: A Fast Interpretable FTP Model |
Jing Lu et.al. |
2506.19885 |
null |
2025-06-24 |
MambaOutRS: A Hybrid CNN-Fourier Architecture for Remote Sensing Image Classification |
Minjong Cheon et.al. |
2506.19561 |
null |
2025-06-24 |
AMF-MedIT: An Efficient Align-Modulation-Fusion Framework for Medical Image-Tabular Data |
Congjing Yu et.al. |
2506.19439 |
null |
2025-06-24 |
JCAPT: A Joint Modeling Approach for CAPT |
Tzu-Hsuan Yang et.al. |
2506.19315 |
null |
2025-06-24 |
3D-SSM: A Novel 3D Selective Scan Module for Remote Sensing Change Detection |
Rui Huang et.al. |
2506.19263 |
null |
2025-06-23 |
Diffusion Transformer-to-Mamba Distillation for High-Resolution Image Generation |
Yuan Yao et.al. |
2506.18999 |
null |
2025-06-22 |
Damba-ST: Domain-Adaptive Mamba for Efficient Urban Spatio-Temporal Prediction |
Rui An et.al. |
2506.18939 |
null |
2025-06-23 |
MARL-MambaContour: Unleashing Multi-Agent Deep Reinforcement Learning for Active Contour Optimization in Medical Image Segmentation |
Ruicheng Zhang et.al. |
2506.18679 |
null |
2025-06-23 |
BSMamba: Brightness and Semantic Modeling for Long-Range Interaction in Low-Light Image Enhancement |
Tongshun Zhang et.al. |
2506.18346 |
null |
2025-06-23 |
Jet Reconstruction with Mamba Networks in Collider Events |
Jinmian Li et.al. |
2506.18336 |
null |
2025-06-22 |
Memba: Membrane-driven Parameter-Efficient Fine-Tuning for Mamba |
Donghyun Lee et.al. |
2506.18184 |
null |
2025-06-22 |
Routing Mamba: Scaling State Space Models with Mixture-of-Experts Projection |
Zheng Zhan et.al. |
2506.18145 |
null |
2025-06-22 |
TEM^3-Learning: Time-Efficient Multimodal Multi-Task Learning for Advanced Assistive Driving |
Wenzhuo Liu et.al. |
2506.18084 |
null |
2025-06-22 |
OSDMamba: Enhancing Oil Spill Detection from Remote Sensing Images Using Selective State Space Model |
Shuaiyu Chen et.al. |
2506.18006 |
null |
2025-06-20 |
VMRA-MaR: An Asymmetry-Aware Temporal Framework for Longitudinal Breast Cancer Risk Prediction |
Zijun Sun et.al. |
2506.17412 |
null |
2025-06-20 |
State-Space Models in Efficient Whispered and Multi-dialect Speech Recognition |
Aref Farhadipour et.al. |
2506.16969 |
null |
2025-06-19 |
MambaHash: Visual State Space Deep Hashing Model for Large-Scale Image Retrieval |
Chao He et.al. |
2506.16353 |
link |
2025-06-19 |
EDNet: A Distortion-Agnostic Speech Enhancement Framework with Gating Mamba Mechanism and Phase Shift-Invariant Training |
Doyeop Kwak et.al. |
2506.16231 |
null |
2025-06-19 |
FlowRAM: Grounding Flow Matching Policy with Region-Aware Mamba Framework for Robotic Manipulation |
Sen Wang et.al. |
2506.16201 |
null |
2025-06-19 |
LBMamba: Locally Bi-directional Mamba |
Jingwei Zhang et.al. |
2506.15976 |
null |
2025-06-18 |
Emergence of Primacy and Recency Effect in Mamba: A Mechanistic Point of View |
Muhammad Cendekia Airlangga et.al. |
2506.15156 |
null |
2025-06-17 |
FADPNet: Frequency-Aware Dual-Path Network for Face Super-Resolution |
Siyu Xu et.al. |
2506.14121 |
null |
2025-06-16 |
Scaling Algorithm Distillation for Continuous Control with Mamba |
Samuel Beaussant et.al. |
2506.13892 |
null |
2025-06-16 |
Stereo sound event localization and detection based on PSELDnet pretraining and BiMamba sequence modeling |
Wenmiao Gao et.al. |
2506.13455 |
null |
2025-06-16 |
MT-PCR: A Hybrid Mamba-Transformer with Spatial Serialization for Hierarchical Point Cloud Registration |
Bingxi Liu et.al. |
2506.13183 |
null |
2025-06-15 |
Unleashing Diffusion and State Space Models for Medical Image Segmentation |
Rong Wu et.al. |
2506.12747 |
null |
2025-06-14 |
An Exploration of Mamba for Speech Self-Supervised Models |
Tzu-Quan Lin et.al. |
2506.12606 |
null |
2025-06-14 |
Generalizable Trajectory Prediction via Inverse Reinforcement Learning with Mamba-Graph Architecture |
Wenyun Li et.al. |
2506.12474 |
null |
2025-06-14 |
MS-UMamba: An Improved Vision Mamba Unet for Fetal Abdominal Medical Image Segmentation |
Caixu Xu et.al. |
2506.12441 |
null |
2025-06-13 |
InceptionMamba: Efficient Multi-Stage Feature Enhancement with Selective State Space Model for Microscopic Medical Image Segmentation |
Daniya Najiha Abdul Kareem et.al. |
2506.12208 |
null |
2025-06-13 |
pLSTM: parallelizable Linear Source Transition Mark networks |
Korbinian Pöppel et.al. |
2506.11997 |
null |
2025-06-13 |
Understanding Input Selectivity in Mamba: Impact on Approximation Power, Memorization, and Associative Recall Capacity |
Ningyuan Huang et.al. |
2506.11891 |
null |
2025-06-13 |
MambaVSR: Content-Aware Scanning State Space Model for Video Super-Resolution |
Linfeng He et.al. |
2506.11768 |
null |
2025-06-13 |
Dissecting the Segmentation Model of End-to-End Diarization with Vector Clustering |
Alexis Plaquet et.al. |
2506.11605 |
null |
2025-06-11 |
Towards a general-purpose foundation model for fMRI analysis |
Cheng Wang et.al. |
2506.11167 |
null |
2025-06-12 |
Sequential-Parallel Duality in Prefix Scannable Models |
Morris Yau et.al. |
2506.10918 |
null |
2025-06-12 |
M4V: Multi-Modal Mamba for Text-to-Video Generation |
Jiancheng Huang et.al. |
2506.10915 |
null |
2025-06-12 |
DART: Differentiable Dynamic Adaptive Region Tokenizer for Vision Transformer and Mamba |
Shicheng Yin et.al. |
2506.10390 |
link |
2025-06-11 |
SparseSSM: Efficient Selective Structured State Space Models Can Be Pruned in One-Shot |
Kaiwen Tuo et.al. |
2506.09613 |
null |
2025-06-10 |
KARMA: A Multilevel Decomposition Hybrid Mamba Framework for Multivariate Long-Term Time Series Forecasting |
Hang Ye et.al. |
2506.08939 |
link |
2025-06-12 |
InceptionMamba: An Efficient Hybrid Network with Large Band Convolution and Bottleneck Mamba |
Yuhang Wang et.al. |
2506.08735 |
link |
2025-06-10 |
ECMNet:Lightweight Semantic Segmentation with Efficient CNN-Mamba Network |
Feixiang Du et.al. |
2506.08629 |
null |
2025-06-10 |
MLVTG: Mamba-Based Feature Alignment and LLM-Driven Purification for Multi-Modal Video Temporal Grounding |
Zhiyi Zhu et.al. |
2506.08512 |
null |
2025-06-10 |
SEMA: a Scalable and Efficient Mamba like Attention via Token Localization and Averaging |
Nhat Thanh Tran et.al. |
2506.08297 |
null |
2025-06-09 |
M2Restore: Mixture-of-Experts-based Mamba-CNN Fusion Framework for All-in-One Image Restoration |
Yongzhen Wang et.al. |
2506.07814 |
null |
2025-06-09 |
FMaMIL: Frequency-Driven Mamba Multi-Instance Learning for Weakly Supervised Lesion Segmentation in Medical Images |
Hangbei Cheng et.al. |
2506.07652 |
null |
2025-06-14 |
FAMSeg: Fetal Femur and Cranial Ultrasound Segmentation Using Feature-Aware Attention and Mamba Enhancement |
Jie He et.al. |
2506.07431 |
null |
2025-06-07 |
Polar Hierarchical Mamba: Towards Streaming LiDAR Object Detection with Point Clouds as Egocentric Sequences |
Mellon M. Zhang et.al. |
2506.06944 |
null |
2025-06-07 |
Hybrid Vision Transformer-Mamba Framework for Autism Diagnosis via Eye-Tracking Analysis |
Wafaa Kasri et.al. |
2506.06886 |
null |
2025-06-07 |
Flood-DamageSense: Multimodal Mamba with Multitask Learning for Building Flood Damage Assessment using SAR Remote Sensing Imagery |
Yu-Hsuan Ho et.al. |
2506.06667 |
link |
2025-06-05 |
DM-SegNet: Dual-Mamba Architecture for 3D Medical Image Segmentation with Global Context Modeling |
Hangyu Ji et.al. |
2506.05297 |
null |
2025-06-05 |
MesaNet: Sequence Modeling by Locally Optimal Test-Time Training |
Johannes von Oswald et.al. |
2506.05233 |
null |
2025-06-09 |
A Diffusion-Driven Temporal Super-Resolution and Spatial Consistency Enhancement Framework for 4D MRI imaging |
Xuanru Zhou et.al. |
2506.04116 |
link |
2025-06-05 |
MambaNeXt-YOLO: A Hybrid State Space Model for Real-time Object Detection |
Xiaochun Lei et.al. |
2506.03654 |
null |
2025-06-04 |
MamFusion: Multi-Mamba with Temporal Fusion for Partially Relevant Video Retrieval |
Xinru Ying et.al. |
2506.03473 |
null |
2025-06-03 |
SportMamba: Adaptive Non-Linear Multi-Object Tracking with State Space Models for Team Sports |
Dheeraj Khanna et.al. |
2506.03335 |
null |
2025-06-03 |
ConMamba: Contrastive Vision Mamba for Plant Disease Detection |
Abdullah Al Mamun et.al. |
2506.03213 |
null |
2025-06-03 |
InterMamba: Efficient Human-Human Interaction Generation with Adaptive Spatio-Temporal Mamba |
Zizhao Wu et.al. |
2506.03084 |
null |
2025-06-07 |
Transferable Sequential Recommendation with Vanilla Cross-Entropy Loss |
Hao Fan et.al. |
2506.02916 |
null |
2025-06-03 |
ControlMambaIR: Conditional Controls with State-Space Model for Image Restoration |
Cheng Yang et.al. |
2506.02633 |
null |
2025-06-08 |
Comba: Improving Bilinear RNNs with Closed-loop Control |
Jiaxi Hu et.al. |
2506.02475 |
null |
2025-06-02 |
Are Mamba-based Audio Foundation Models the Best Fit for Non-Verbal Emotion Recognition? |
Mohd Mujtaba Akhtar et.al. |
2506.02258 |
null |
2025-06-01 |
Mamba Drafters for Speculative Decoding |
Daewon Choi et.al. |
2506.01206 |
null |
2025-06-01 |
PARROT: Synergizing Mamba and Attention-based SSL Pre-Trained Models via Parallel Branch Hadamard Optimal Transport for Speech Emotion Recognition |
Orchid Chetia Phukan et.al. |
2506.01138 |
null |
2025-06-01 |
ECP-Mamba: An Efficient Multi-scale Self-supervised Contrastive Learning Method with State Space Model for PolSAR Image Classification |
Zuzheng Kuang et.al. |
2506.01040 |
null |
2025-06-01 |
Self-supervised ControlNet with Spatio-Temporal Mamba for Real-world Video Super-resolution |
Shijun Shi et.al. |
2506.01037 |
null |
2025-06-01 |
General-purpose audio representation learning for real-world sound scenes |
Goksenin Yuksel et.al. |
2506.00934 |
null |
2025-06-01 |
3D Skeleton-Based Action Recognition: A Review |
Mengyuan Liu et.al. |
2506.00915 |
null |
2025-05-30 |
ACM-UNet: Adaptive Integration of CNNs and Mamba for Efficient Medical Image Segmentation |
Jing Huang et.al. |
2505.24481 |
null |
2025-05-30 |
Mamba Knockout for Unraveling Factual Information Flow |
Nir Endy et.al. |
2505.24244 |
link |
2025-05-29 |
Mamba Integrated with Physics Principles Masters Long-term Chaotic System Forecasting |
Chang Liu et.al. |
2505.23863 |
null |
2025-05-29 |
SAMamba: Adaptive State Space Modeling with Hierarchical Vision for Infrared Small Target Detection |
Wenhao Xu et.al. |
2505.23214 |
link |
2025-05-29 |
Loss-Guided Model Sharing and Local Learning Correction in Decentralized Federated Learning for Crop Disease Classification |
Denis Mamba Kabala et.al. |
2505.23063 |
null |
2025-05-29 |
RiverMamba: A State Space Model for Global River Discharge and Flood Forecasting |
Mohamad Hakam Shams Eddin et.al. |
2505.22535 |
null |
2025-05-28 |
StateSpaceDiffuser: Bringing Long Context to Diffusion World Models |
Nedko Savov et.al. |
2505.22246 |
null |
2025-05-27 |
Revisiting Bi-Linear State Transitions in Recurrent Neural Networks |
M. Reza Ebrahimi et.al. |
2505.21749 |
null |
2025-05-29 |
Scaling Up Liquid-Resistance Liquid-Capacitance Networks for Efficient Sequence Modeling |
Mónika Farsang et.al. |
2505.21717 |
null |
2025-06-10 |
ZigzagPointMamba: Spatial-Semantic Mamba for Point Cloud Understanding |
Linshuang Diao et.al. |
2505.21381 |
null |
2025-05-27 |
Universal Speech Enhancement with Regression and Generative Mamba |
Rong Chao et.al. |
2505.21198 |
null |
2025-05-27 |
PMA: Towards Parameter-Efficient Point Cloud Understanding via Point Mamba Adapter |
Yaohua Zha et.al. |
2505.20941 |
null |
2025-05-28 |
HTMNet: A Hybrid Network with Transformer-Mamba Bottleneck Multimodal Fusion for Transparent and Reflective Objects Depth Completion |
Guanghu Xie et.al. |
2505.20904 |
null |
2025-05-27 |
TimePro: Efficient Multivariate Long-term Time Series Forecasting with Variable- and Time-Aware Hyper-state |
Xiaowen Ma et.al. |
2505.20774 |
link |
2025-05-27 |
Sparsified State-Space Models are Efficient Highway Networks |
Woomin Song et.al. |
2505.20698 |
link |
2025-05-27 |
OccLE: Label-Efficient 3D Semantic Occupancy Prediction |
Naiyu Fang et.al. |
2505.20617 |
null |
2025-05-27 |
Mamba-Driven Topology Fusion for Monocular 3-D Human Pose Estimation |
Zenghao Zheng et.al. |
2505.20611 |
null |
2025-05-28 |
Latent Mamba Operator for Partial Differential Equations |
Karn Tiwari et.al. |
2505.19105 |
null |
2025-05-28 |
FastMamba: A High-Speed and Efficient Mamba Accelerator on FPGA with Accurate Quantization |
Aotao Wang et.al. |
2505.18975 |
null |
2025-05-27 |
Hierarchical Mamba Meets Hyperbolic Geometry: A New Paradigm for Structured Language Embeddings |
Sarang Patil et.al. |
2505.18973 |
link |
2025-05-24 |
TK-Mamba: Marrying KAN with Mamba for Text-Driven 3D Medical Image Segmentation |
Haoyu Yang et.al. |
2505.18525 |
link |
2025-05-23 |
Weakly-supervised Mamba-Based Mastoidectomy Shape Prediction for Cochlear Implant Surgery Using 3D T-Distribution Loss |
Yike Zhang et.al. |
2505.18368 |
null |
2025-05-23 |
Selection Mechanisms for Sequence Modeling using Linear State Space Models |
Umberto Casti et.al. |
2505.17932 |
null |
2025-05-23 |
Hybrid Mamba-Transformer Decoder for Error-Correcting Codes |
Shy-el Cohen et.al. |
2505.17834 |
null |
2025-05-23 |
Structured Linear CDEs: Maximally Expressive and Parallel-in-Time Sequence Models |
Benjamin Walker et.al. |
2505.17761 |
link |
2025-05-23 |
Causal Spatio-Temporal Prediction: An Effective and Efficient Multi-Modal Approach |
Yuting Huang et.al. |
2505.17637 |
null |
2025-05-31 |
MEGADance: Mixture-of-Experts Architecture for Genre-Aware 3D Dance Generation |
Kaixing Yang et.al. |
2505.17543 |
null |
2025-05-23 |
Graph Mamba for Efficient Whole Slide Image Understanding |
Jiaxuan Lu et.al. |
2505.17457 |
null |
2025-05-27 |
EVM-Fusion: An Explainable Vision Mamba Architecture with Neural Algorithmic Fusion |
Zichuan Yang et.al. |
2505.17367 |
null |
2025-06-02 |
JanusDNA: A Powerful Bi-directional Hybrid DNA Foundation Model |
Qihao Duan et.al. |
2505.17257 |
null |
2025-05-23 |
Active Speech Enhancement: Active Speech Denoising Decliping and Deveraberation |
Ofir Yaish et.al. |
2505.16911 |
null |
2025-05-22 |
PCMamba: Physics-Informed Cross-Modal State Space Model for Dual-Camera Compressive Hyperspectral Imaging |
Ge Meng et.al. |
2505.16373 |
null |
2025-05-22 |
SAMba-UNet: Synergizing SAM2 and Mamba in UNet with Heterogeneous Aggregation for Cardiac MRI Segmentation |
Guohao Huo et.al. |
2505.16304 |
null |
2025-05-21 |
FR-Mamba: Time-Series Physical Field Reconstruction Based on State Space Model |
Jiahuan Long et.al. |
2505.16083 |
null |
2025-05-21 |
HAMF: A Hybrid Attention-Mamba Framework for Joint Scene Context Understanding and Future Motion Representation Learning |
Xiaodong Mei et.al. |
2505.15703 |
null |
2025-05-22 |
Hunyuan-TurboS: Advancing Large Language Models through Mamba-Transformer Synergy and Adaptive Chain-of-Thought |
Tencent Hunyuan Team et.al. |
2505.15431 |
null |
2025-05-21 |
SAMA-UNet: Enhancing Medical Image Segmentation with Self-Adaptive Mamba-Like Attention and Causal-Resonance Learning |
Saqib Qamar et.al. |
2505.15234 |
link |
2025-05-21 |
Mechanistic evaluation of Transformers and state space models |
Aryaman Arora et.al. |
2505.15105 |
null |
2025-05-21 |
RLBenchNet: The Right Network for the Right Reinforcement Learning Task |
Ivan Smirnov et.al. |
2505.15040 |
link |
2025-05-20 |
RefiDiff: Refinement-Aware Diffusion for Efficient Missing Data Imputation |
Md Atik Ahamed et.al. |
2505.14451 |
null |
2025-05-20 |
TF-Mamba: Text-enhanced Fusion Mamba with Missing Modalities for Robust Multimodal Sentiment Analysis |
Xiang Li et.al. |
2505.14329 |
link |
2025-05-21 |
MatchDance: Collaborative Mamba-Transformer Architecture Matching for High-Quality 3D Dance Synthesis |
Kaixing Yang et.al. |
2505.14222 |
null |
2025-05-20 |
Scaling Vision Mamba Across Resolutions via Fractal Traversal |
Bo Li et.al. |
2505.14062 |
null |
2025-05-23 |
Selective Structured State Space for Multispectral-fused Small Target Detection |
Qianqian Zhang et.al. |
2505.14043 |
null |
2025-05-20 |
BiCrossMamba-ST: Speech Deepfake Detection with Bidirectional Mamba Spectro-Temporal Cross-Attention |
Yassine El Kheir et.al. |
2505.13930 |
null |
2025-05-19 |
SourceDetMamba: A Graph-aware State Space Model for Source Detection in Sequential Hypergraphs |
Le Cheng et.al. |
2505.12910 |
null |
2025-05-19 |
Mamba-Adaptor: State Space Model Adaptor for Visual Recognition |
Fei Xie et.al. |
2505.12685 |
null |
2025-05-19 |
Safe-Sora: Safe Text-to-Video Generation via Graphical Watermarking |
Zihan Su et.al. |
2505.12667 |
null |
2025-05-18 |
Alternators With Noise Models |
Mohammad R. Rezaei et.al. |
2505.12544 |
null |
2025-05-18 |
MTIL: Encoding Full History with Mamba for Temporal Imitation Learning |
Yulin Zhou et.al. |
2505.12410 |
link |
2025-05-17 |
WaLRUS: Wavelets for Long-range Representation Using SSMs |
Hossein Babaei et.al. |
2505.12161 |
null |
2025-05-17 |
GeoMaNO: Geometric Mamba Neural Operator for Partial Differential Equations |
Xi Han et.al. |
2505.12020 |
null |
2025-05-22 |
Multi-modal Collaborative Optimization and Expansion Network for Event-assisted Single-eye Expression Recognition |
Runduo Han et.al. |
2505.12007 |
link |
2025-05-17 |
MedVKAN: Efficient Feature Extraction with Mamba and KAN for Medical Image Segmentation |
Hancan Zhu et.al. |
2505.11797 |
link |
2025-05-21 |
Spatiotemporal Field Generation Based on Hybrid Mamba-Transformer with Physics-informed Fine-tuning |
Peimian Du et.al. |
2505.11578 |
null |
2025-05-16 |
Equal is Not Always Fair: A New Perspective on Hyperspectral Representation Non-Uniformity |
Wuzhou Quan et.al. |
2505.11267 |
null |
2025-05-16 |
Hybrid-Emba3D: Geometry-Aware and Cross-Path Feature Hybrid Enhanced State Space Model for Point Cloud Classification |
Bin Liu et.al. |
2505.11099 |
link |
2025-05-16 |
HSRMamba: Efficient Wavelet Stripe State Space Model for Hyperspectral Image Super-Resolution |
Baisong Li et.al. |
2505.11062 |
link |
2025-05-15 |
SRMamba: Mamba for Super-Resolution of LiDAR Point Clouds |
Chuang Chen et.al. |
2505.10601 |
null |
2025-05-14 |
Efficient Malicious UAV Detection Using Autoencoder-TSMamba Integration |
Azim Akhtarshenas et.al. |
2505.10585 |
null |
2025-05-20 |
HWA-UNETR: Hierarchical Window Aggregate UNETR for 3D Multimodal Gastric Lesion Segmentation |
Jiaming Liang et.al. |
2505.10464 |
link |
2025-05-15 |
MambaControl: Anatomy Graph-Enhanced Mamba ControlNet with Fourier Refinement for Diffusion-Based Disease Trajectory Prediction |
Hao Yang et.al. |
2505.09965 |
null |
2025-05-14 |
Dyadic Mamba: Long-term Dyadic Human Motion Synthesis |
Julian Tanke et.al. |
2505.09827 |
null |
2025-05-14 |
Spec2VolCAMU-Net: A Spectrogram-to-Volume Model for EEG-to-fMRI Reconstruction based on Multi-directional Time-Frequency Convolutional Attention Encoder and Vision-Mamba U-Net |
Dongyi He et.al. |
2505.09521 |
link |
2025-05-14 |
MrTrack: Register Mamba for Needle Tracking with Rapid Reciprocating Motion during Ultrasound-Guided Aspiration Biopsy |
Yuelin Zhang et.al. |
2505.09450 |
null |
2025-05-14 |
Efficient LiDAR Reflectance Compression via Scanning Serialization |
Jiahao Zhu et.al. |
2505.09433 |
null |
2025-05-14 |
HMamba: Hyperbolic Mamba for Sequential Recommendation |
Qianru Zhang et.al. |
2505.09205 |
null |
2025-05-13 |
Block-Biased Mamba for Long-Range Sequence Processing |
Annan Yu et.al. |
2505.09022 |
null |
2025-05-13 |
SPAT: Sensitivity-based Multihead-attention Pruning on Time Series Forecasting Models |
Suhan Guo et.al. |
2505.08768 |
null |
2025-05-13 |
A Mamba-based Network for Semi-supervised Singing Melody Extraction Using Confidence Binary Regularization |
Xiaoliang He et.al. |
2505.08681 |
link |
2025-05-13 |
ReSurgSAM2: Referring Segment Anything in Surgical Video via Credible Long-term Tracking |
Haofeng Liu et.al. |
2505.08581 |
link |
2025-05-13 |
Efficient Unstructured Pruning of Mamba State-Space Models for Resource-Constrained Environments |
Ibne Farabi Shihab et.al. |
2505.08299 |
null |
2025-05-12 |
Overflow Prevention Enhances Long-Context Recurrent LLMs |
Assaf Ben-Kish et.al. |
2505.07793 |
link |
2025-05-12 |
SmartUT: Receive Beamforming for Spectral Coexistence of NGSO Satellite Systems |
Almoatssimbillah Saifaldawla et.al. |
2505.07714 |
null |
2025-05-12 |
ABS-Mamba: SAM2-Driven Bidirectional Spiral Mamba Network for Medical Image Translation |
Feng Yuan et.al. |
2505.07687 |
null |
2025-05-10 |
Probing In-Context Learning: Impact of Task Complexity and Model Architecture on Generalization and Efficiency |
Binwen Liu et.al. |
2505.06475 |
link |
2025-05-09 |
Topo-VM-UNetV2: Encoding Topology into Vision Mamba UNet for Polyp Segmentation |
Diego Adame et.al. |
2505.06210 |
null |
2025-05-02 |
MDDFNet: Mamba-based Dynamic Dual Fusion Network for Traffic Sign Detection |
TianYi Yu et.al. |
2505.05491 |
null |
2025-05-08 |
PillarMamba: Learning Local-Global Context for Roadside Point Cloud via Hybrid State Space Model |
Zhang Zhang et.al. |
2505.05397 |
null |
2025-05-08 |
EDmamba: A Simple yet Effective Event Denoising Method with State Space Model |
Ciyu Ruan et.al. |
2505.05391 |
null |
2025-05-08 |
PRE-Mamba: A 4D State Space Model for Ultra-High-Frequent Event Camera Deraining |
Ciyu Ruan et.al. |
2505.05307 |
null |
2025-05-07 |
Cross-organ all-in-one parallel compressed sensing magnetic resonance imaging |
Baoshun Shi et.al. |
2505.04658 |
link |
2025-05-07 |
M2Rec: Multi-scale Mamba for Efficient Sequential Recommendation |
Qianru Zhang et.al. |
2505.04445 |
null |
2025-05-07 |
WDMamba: When Wavelet Degradation Prior Meets Vision Mamba for Image Dehazing |
Jie Sun et.al. |
2505.04369 |
link |
2025-05-11 |
SMMT: Siamese Motion Mamba with Self-attention for Thermal Infrared Target Tracking |
Shang Zhang et.al. |
2505.04088 |
null |
2025-05-02 |
Design description of Wisdom Computing Persperctive |
TianYi Yu et.al. |
2505.03800 |
null |
2025-05-06 |
A Comprehensive Survey of Large AI Models for Future Communications: Foundations, Applications and Challenges |
Feibo Jiang et.al. |
2505.03556 |
link |
2025-05-06 |
Recall with Reasoning: Chain-of-Thought Distillation for Mamba’s Long-Context Memory and Extrapolation |
Junyu Ma et.al. |
2505.03320 |
null |
2025-05-06 |
Mamba-Diffusion Model with Learnable Wavelet for Controllable Symbolic Music Generation |
Jincheng Zhang et.al. |
2505.03314 |
link |
2025-05-06 |
DiffVQA: Video Quality Assessment Using Diffusion Feature Extractor |
Wei-Ting Chen et.al. |
2505.03261 |
null |
2025-05-05 |
Text to Image Generation and Editing: A Survey |
Pengfei Yang et.al. |
2505.02527 |
null |
2025-05-04 |
Meta-Black-Box-Optimization through Offline Q-function Learning |
Zeyuan Ma et.al. |
2505.02010 |
link |
2025-05-02 |
RD-UIE: Relation-Driven State Space Modeling for Underwater Image Enhancement |
Kui Jiang et.al. |
2505.01224 |
link |
2025-05-02 |
LMDepth: Lightweight Mamba-based Monocular Depth Estimation for Real-World Deployment |
Jiahuan Long et.al. |
2505.00980 |
null |
2025-05-03 |
Vision Mamba in Remote Sensing: A Comprehensive Survey of Techniques, Applications and Outlook |
Muyi Bao et.al. |
2505.00630 |
link |
2025-04-30 |
Mamba Based Feature Extraction And Adaptive Multilevel Feature Fusion For 3D Tumor Segmentation From Multi-modal Medical Image |
Zexin Ji et.al. |
2504.21281 |
null |
2025-04-29 |
DYNAMAX: Dynamic computing for Transformers and Mamba based architectures |
Miguel Nogales et.al. |
2504.20922 |
null |
2025-04-29 |
ISDrama: Immersive Spatial Drama Generation through Multimodal Prompting |
Yu Zhang et.al. |
2504.20630 |
null |
2025-04-29 |
MambaMoE: Mixture-of-Spectral-Spatial-Experts State Space Model for Hyperspectral Image Classification |
Yichu Xu et.al. |
2504.20509 |
null |
2025-04-28 |
GPA-RAM: Grasp-Pretraining Augmented Robotic Attention Mamba for Spatial Task Learning |
Juyi Sheng et.al. |
2504.19683 |
null |
2025-04-25 |
RSFR: A Coarse-to-Fine Reconstruction Framework for Diffusion Tensor Cardiac MRI with Semantic-Aware Refinement |
Jiahao Huang et.al. |
2504.18520 |
null |
2025-04-24 |
Mamba-Sea: A Mamba-based Framework with Global-to-Local Sequence Augmentation for Generalizable Medical Image Segmentation |
Zihan Cheng et.al. |
2504.17515 |
link |
2025-04-24 |
StereoMamba: Real-time and Robust Intraoperative Stereo Disparity Estimation via Long-range Spatial Dependencies |
Xu Wang et.al. |
2504.17401 |
null |
2025-04-22 |
Bidirectional Mamba for Single-Cell Data: Efficient Context Learning with Biological Fidelity |
Cong Qi et.al. |
2504.16956 |
null |
2025-04-23 |
Random Long-Context Access for Mamba via Hardware-aligned Hierarchical Sparse Attention |
Xiang Hu et.al. |
2504.16795 |
null |
2025-04-23 |
A Diff-Attention Aware State Space Fusion Model for Remote Sensing Classification |
Wenping Ma et.al. |
2504.16665 |
link |
2025-04-22 |
LongMamba: Enhancing Mamba’s Long Context Capabilities via Training-Free Receptive Field Enlargement |
Zhifan Ye et.al. |
2504.16053 |
link |
2025-04-22 |
MVQA: Mamba with Unified Sampling for Efficient Video Quality Assessment |
Yachun Mi et.al. |
2504.16003 |
null |
2025-05-05 |
Observability conditions for neural state-space models with eigenvalues and their roots of unity |
Andrew Gracyk et.al. |
2504.15758 |
null |
2025-04-22 |
HS-Mamba: Full-Field Interaction Multi-Groups Mamba for Hyperspectral Image Classification |
Hongxing Peng et.al. |
2504.15612 |
null |
2025-04-27 |
Instance-Adaptive Keypoint Learning with Local-to-Global Geometric Aggregation for Category-Level Object Pose Estimation |
Xiao Zhang et.al. |
2504.15134 |
null |
2025-04-20 |
VM-BHINet:Vision Mamba Bimanual Hand Interaction Network for 3D Interacting Hand Mesh Recovery From a Single RGB Image |
Han Bi et.al. |
2504.14618 |
null |
2025-04-19 |
Efficient Spiking Point Mamba for Point Cloud Analysis |
Peixi Wu et.al. |
2504.14371 |
null |
2025-04-18 |
WeatherGen: A Unified Diverse Weather Generator for LiDAR Point Clouds via Spider Mamba Diffusion |
Yang Wu et.al. |
2504.13561 |
link |
2025-04-18 |
A Novel Hybrid Approach for Retinal Vessel Segmentation with Dynamic Long-Range Dependency and Multi-Scale Retinal Edge Fusion Enhancement |
Yihao Ouyang et.al. |
2504.13553 |
link |
2025-04-26 |
U-Shape Mamba: State Space Model for faster diffusion |
Alex Ergasti et.al. |
2504.13499 |
link |
2025-04-16 |
RadMamba: Efficient Human Activity Recognition through Radar-based Micro-Doppler-Oriented Mamba State-Space Model |
Yizhuo Wu et.al. |
2504.12039 |
link |
2025-04-16 |
ACMamba: Fast Unsupervised Anomaly Detection via An Asymmetrical Consensus State Space Model |
Guanchun Wang et.al. |
2504.11781 |
null |
2025-04-15 |
Mamba-Based Ensemble learning for White Blood Cell Classification |
Lewis Clifton et.al. |
2504.11438 |
link |
2025-04-15 |
Change State Space Models for Remote Sensing Change Detection |
Elman Ghazaei et.al. |
2504.11080 |
link |
2025-04-20 |
An Efficient and Mixed Heterogeneous Model for Image Restoration |
Yubin Gu et.al. |
2504.10967 |
link |
2025-04-14 |
M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models |
Junxiong Wang et.al. |
2504.10449 |
link |
2025-04-14 |
Global and Local Mamba Network for Multi-Modality Medical Image Super-Resolution |
Zexin Ji et.al. |
2504.10105 |
null |
2025-04-24 |
OmniMamba4D: Spatio-temporal Mamba for longitudinal CT lesion segmentation |
Justin Namuk Kim et.al. |
2504.09655 |
null |
2025-04-15 |
Sparse Deformable Mamba for Hyperspectral Image Classification |
Lincoln Linlin Xu et.al. |
2504.09446 |
null |
2025-04-12 |
Repetitive Contrastive Learning Enhances Mamba’s Selectivity in Time Series Prediction |
Wenbo Yan et.al. |
2504.09185 |
null |
2025-04-11 |
EMO-X: Efficient Multi-Person Pose and Shape Estimation in One-Stage |
Haohang Jian et.al. |
2504.08718 |
null |
2025-04-10 |
ms-Mamba: Multi-scale Mamba for Time-Series Forecasting |
Yusuf Meric Karadag et.al. |
2504.07654 |
null |
2025-04-10 |
A Novel Mamba-based Sequential Recommendation Method |
Jun Yuan et.al. |
2504.07398 |
null |
2025-04-10 |
Novel Diffusion Models for Multimodal 3D Hand Trajectory Prediction |
Junyi Ma et.al. |
2504.07375 |
link |
2025-04-09 |
HGMamba: Enhancing 3D Human Pose Estimation with a HyperGCN-Mamba Network |
Hu Cui et.al. |
2504.06638 |
null |
2025-04-08 |
DefMamba: Deformable Visual State Space Model |
Leiye Liu et.al. |
2504.05794 |
null |
2025-04-07 |
One-Minute Video Generation with Test-Time Training |
Karan Dalal et.al. |
2504.05298 |
null |
2025-04-07 |
GAMDTP: Dynamic Trajectory Prediction with Graph Attention Mamba Network |
Yunxiang Liu et.al. |
2504.04862 |
null |
2025-04-07 |
Dynamic Vision Mamba |
Mengxuan Wu et.al. |
2504.04787 |
link |
2025-04-06 |
GAMBAS: Generalised-Hilbert Mamba for Super-resolution of Paediatric Ultra-Low-Field MRI |
Levente Baljer et.al. |
2504.04523 |
link |
2025-04-06 |
Gating is Weighting: Understanding Gated Linear Attention through In-context Learning |
Yingcong Li et.al. |
2504.04308 |
null |
2025-04-15 |
Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models |
NVIDIA et.al. |
2504.03624 |
null |
2025-04-04 |
Pyramid-based Mamba Multi-class Unsupervised Anomaly Detection |
Nasar Iqbal et.al. |
2504.03442 |
null |
2025-04-15 |
Mamba as a Bridge: Where Vision Foundation Models Meet Vision Language Models for Domain-Generalized Semantic Segmentation |
Xin Zhang et.al. |
2504.03193 |
link |
2025-04-07 |
Audio-visual Controlled Video Diffusion with Masked Selective State Spaces Modeling for Natural Talking Head Generation |
Fa-Ting Hong et.al. |
2504.02542 |
link |
2025-04-24 |
Cognitive Memory in Large Language Models |
Lianlei Shan et.al. |
2504.02441 |
null |
2025-04-03 |
EvMic: Event-based Non-contact sound recovery from effective spatial-temporal modeling |
Hao Yin et.al. |
2504.02402 |
null |
2025-04-02 |
Attention Mamba: Time Series Modeling with Adaptive Pooling Acceleration and Receptive Field Enhancements |
Sijie Xiong et.al. |
2504.02013 |
null |
2025-04-09 |
Mesh Mamba: A Unified State Space Model for Saliency Prediction in Non-Textured and Textured Meshes |
Kaiwei Zhang et.al. |
2504.01466 |
link |
2025-04-02 |
CFMD: Dynamic Cross-layer Feature Fusion for Salient Object Detection |
Jin Lian et.al. |
2504.01326 |
null |
2025-03-30 |
ViT-Linearizer: Distilling Quadratic Knowledge into Linear-Time Vision Models |
Guoyizhe Wei et.al. |
2504.00037 |
null |
2025-03-31 |
TransMamba: Flexibly Switching between Transformer and Mamba |
Yixing Li et.al. |
2503.24067 |
null |
2025-03-31 |
AMMSM: Adaptive Motion Magnification and Sparse Mamba for Micro-Expression Recognition |
Xuxiong Liu et.al. |
2503.24057 |
null |
2025-03-31 |
Exploring Temporal Dynamics in Event-based Eye Tracker |
Hongwei Ren et.al. |
2503.23725 |
link |
2025-03-29 |
Efficient Explicit Joint-level Interaction Modeling with Mamba for Text-guided HOI Generation |
Guohong Huang et.al. |
2503.23121 |
link |
2025-03-29 |
SSM-RDU: A Reconfigurable Dataflow Unit for Long-Sequence State-Space Models |
Sho Ko et.al. |
2503.22937 |
null |
2025-03-26 |
Adaptive State-Space Mamba for Real-Time Sensor Data Anomaly Detection |
Alice Zhang et.al. |
2503.22743 |
null |
2025-03-26 |
Ancestral Mamba: Enhancing Selective Discriminant Space Model with Online Visual Prototype Learning for Efficient and Robust Discriminant Approach |
Jiahao Qin et.al. |
2503.22729 |
null |
2025-04-02 |
Q-MambaIR: Accurate Quantized Mamba for Efficient Image Restoration |
Yujie Chen et.al. |
2503.21970 |
null |
2025-03-27 |
vGamba: Attentive State Space Bottleneck for efficient Long-range Dependencies in Visual Recognition |
Yunusa Haruna et.al. |
2503.21262 |
link |
2025-03-27 |
VADMamba: Exploring State Space Models for Fast Video Anomaly Detection |
Jiahao Lyu et.al. |
2503.21169 |
link |
2025-03-26 |
Text-Driven Voice Conversion via Latent State-Space Modeling |
Wen Li et.al. |
2503.20999 |
null |
2025-03-26 |
Exploiting Temporal State Space Sharing for Video Semantic Segmentation |
Syed Ariff Syed Hesham et.al. |
2503.20824 |
null |
2025-03-26 |
Mamba-3D as Masked Autoencoders for Accurate and Data-Efficient Analysis of Medical Ultrasound Videos |
Jiaheng Zhou et.al. |
2503.20258 |
link |
2025-04-01 |
EventMamba: Enhancing Spatio-Temporal Locality with State Space Models for Event-Based Video Reconstruction |
Chengjie Ge et.al. |
2503.19721 |
null |
2025-03-25 |
Burst Image Super-Resolution with Mamba |
Ozan Unal et.al. |
2503.19634 |
null |
2025-03-25 |
Prompt-Guided Dual-Path UNet with Mamba for Medical Image Segmentation |
Shaolei Zhang et.al. |
2503.19589 |
null |
2025-03-25 |
Scene-agnostic Pose Regression for Visual Localization |
Junwei Zheng et.al. |
2503.19543 |
null |
2025-03-25 |
ASP-VMUNet: Atrous Shifted Parallel Vision Mamba U-Net for Skin Lesion Segmentation |
Muyi Bao et.al. |
2503.19427 |
link |
2025-03-25 |
A Comprehensive Analysis of Mamba for 3D Volumetric Medical Image Segmentation |
Chaohan Wang et.al. |
2503.19308 |
null |
2025-03-25 |
$L^2$ FMamba: Lightweight Light Field Image Super-Resolution with State Space Model |
Zeqiang Wei et.al. |
2503.19253 |
null |
2025-03-22 |
A Survey on Structured State Space Sequence (S4) Models |
Shriyank Somvanshi et.al. |
2503.18970 |
null |
2025-03-24 |
Distil-xLSTM: Learning Attention Mechanisms through Recurrent Structures |
Abdoul Majid O. Thiombiano et.al. |
2503.18565 |
null |
2025-03-24 |
Exploring State Space Model in Wavelet Domain: An Infrared and Visible Image Fusion Network via Wavelet Transform and State Space Model |
Tianpei Zhang et.al. |
2503.18378 |
null |
2025-03-23 |
M3Net: Multimodal Multi-task Learning for 3D Detection, Segmentation, and Occupancy Prediction in Autonomous Driving |
Xuesong Chen et.al. |
2503.18100 |
link |
2025-03-23 |
Selecting and Pruning: A Differentiable Causal Sequentialized State-Space Model for Two-View Correspondence Learning |
Xiang Fang et.al. |
2503.17938 |
null |
2025-03-23 |
GLADMamba: Unsupervised Graph-Level Anomaly Detection Powered by Selective State Space Model |
Yali Fu et.al. |
2503.17903 |
link |
2025-03-22 |
MAMAT: 3D Mamba-Based Atmospheric Turbulence Removal and its Object Detection Capability |
Paul Hill et.al. |
2503.17700 |
null |
2025-03-21 |
MM-UNet: Meta Mamba UNet for Medical Image Segmentation |
Bin Xie et.al. |
2503.17540 |
null |
2025-03-21 |
Align Your Rhythm: Generating Highly Aligned Dance Poses with Gating-Enhanced Rhythm-Aware Feature Representation |
Congyi Fan et.al. |
2503.17340 |
null |
2025-03-21 |
Cross-Modal Interactive Perception Network with Mamba for Lung Tumor Segmentation in PET-CT Images |
Jie Mei et.al. |
2503.17261 |
link |
2025-03-21 |
Salient Object Detection in Traffic Scene through the TSOD10K Dataset |
Yu Qiu et.al. |
2503.16910 |
null |
2025-03-21 |
Auto-Regressive Diffusion for Generating 3D Human-Object Interactions |
Zichen Geng et.al. |
2503.16801 |
link |
2025-03-20 |
Binarized Mamba-Transformer for Lightweight Quad Bayer HybridEVS Demosaicing |
Shiyang Zhou et.al. |
2503.16134 |
link |
2025-03-20 |
SaMam: Style-aware State Space Model for Arbitrary Image Style Transfer |
Hongda Liu et.al. |
2503.15934 |
link |
2025-03-18 |
Core-Periphery Principle Guided State Space Model for Functional Connectome Classification |
Minheng Chen et.al. |
2503.14655 |
null |
2025-03-18 |
Tiled Flash Linear Attention: More Efficient Linear RNN and xLSTM Kernels |
Maximilian Beck et.al. |
2503.14376 |
link |
2025-03-26 |
MamBEV: Enabling State Space Models to Learn Birds-Eye-View Representations |
Hongyu Ke et.al. |
2503.13858 |
link |
2025-03-18 |
MaTVLM: Hybrid Mamba-Transformer for Efficient Vision-Language Modeling |
Yingyue Li et.al. |
2503.13440 |
link |
2025-03-17 |
xLSTM 7B: A Recurrent LLM for Fast and Efficient Inference |
Maximilian Beck et.al. |
2503.13427 |
link |
2025-03-17 |
Scale Efficient Training for Large Datasets |
Qing Zhou et.al. |
2503.13385 |
link |
2025-03-17 |
DynSTG-Mamba: Dynamic Spatio-Temporal Graph Mamba with Cross-Graph Knowledge Distillation for Gait Disorders Recognition |
Zakariae Zrimek et.al. |
2503.13156 |
null |
2025-03-17 |
TFDM: Time-Variant Frequency-Based Point Cloud Diffusion with Mamba |
Jiaxu Liu et.al. |
2503.13004 |
null |
2025-03-17 |
Pose as a Modality: A Psychology-Inspired Network for Personality Recognition with a New Multimodal Dataset |
Bin Tang et.al. |
2503.12912 |
null |
2025-03-16 |
BS-Mamba for Black-Soil Area Detection On the Qinghai-Tibetan Plateau |
Xuan Ma et.al. |
2503.12495 |
null |
2025-03-16 |
VideoMAP: Toward Scalable Mamba-based Video Autoregressive Pretraining |
Yunze Liu et.al. |
2503.12332 |
null |
2025-03-15 |
Toward Foundation Models for Online Complex Event Detection in CPS-IoT: A Case Study |
Liying Han et.al. |
2503.12282 |
null |
2025-03-18 |
UniMamba: Unified Spatial-Channel Representation Learning with Group-Efficient Mamba for LiDAR-based 3D Object Detection |
Xin Jin et.al. |
2503.12009 |
null |
2025-03-25 |
BioMamba: Leveraging Spectro-Temporal Embedding in Bidirectional Mamba for Enhanced Biosignal Classification |
Jian Qian et.al. |
2503.11741 |
link |
2025-03-14 |
Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers |
Weiming Ren et.al. |
2503.11579 |
null |
2025-03-14 |
Hierarchical Information-Guided Spatio-Temporal Mamba for Stock Time Series Forecasting |
Wenbo Yan et.al. |
2503.11387 |
null |
2025-03-14 |
Breaking Shallow Limits: Task-Driven Pixel Fusion for Gap-free RGBT Tracking |
Andong Lu et.al. |
2503.11247 |
null |
2025-03-14 |
Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models |
Xingtai Lv et.al. |
2503.11224 |
null |
2025-03-14 |
Towards General Multimodal Visual Tracking |
Andong Lu et.al. |
2503.11218 |
null |
2025-03-14 |
FMNet: Frequency-Assisted Mamba-Like Linear Attention Network for Camouflaged Object Detection |
Ming Deng et.al. |
2503.11030 |
null |
2025-03-13 |
OuroMamba: A Data-Free Quantization Framework for Vision Mamba Models |
Akshat Ramachandran et.al. |
2503.10959 |
null |
2025-03-13 |
Trajectory Mamba: Efficient Attention-Mamba Forecasting Model Based on Selective SSM |
Yizhou Huang et.al. |
2503.10898 |
null |
2025-03-13 |
Mamba time series forecasting with uncertainty propagation |
Pedro Pessoa et.al. |
2503.10873 |
link |
2025-03-13 |
Fixed-Point RNNs: From Diagonal to Dense in a Few Iterations |
Sajad Movahedi et.al. |
2503.10799 |
null |
2025-03-09 |
Small Vision-Language Models: A Survey on Compact Architectures and Techniques |
Nitesh Patnaik et.al. |
2503.10665 |
null |
2025-03-14 |
Category Prompt Mamba Network for Nuclei Segmentation and Classification |
Ye Zhang et.al. |
2503.10422 |
null |
2025-03-13 |
RoMA: Scaling up Mamba-based Foundation Models for Remote Sensing |
Fengxiang Wang et.al. |
2503.10392 |
link |
2025-03-13 |
Mamba-VA: A Mamba-based Approach for Continuous Emotion Recognition in Valence-Arousal Space |
Yuheng Liang et.al. |
2503.10104 |
link |
2025-03-13 |
Light-weighted foundation model for seismic data processing based on representative and non-redundant pre-training dataset |
Xintong Dong et.al. |
2503.10092 |
null |
2025-03-13 |
Multi-Modal Mamba Modeling for Survival Prediction (M4Survive): Adapting Joint Foundation Model Representations |
Ho Hin Lee et.al. |
2503.10057 |
link |
2025-03-12 |
ViM-VQ: Efficient Post-Training Vector Quantization for Visual Mamba |
Juncan Deng et.al. |
2503.09509 |
null |
2025-03-12 |
Diff-CL: A Novel Cross Pseudo-Supervision Method for Semi-supervised Medical Image Segmentation |
Xiuzhen Guo et.al. |
2503.09408 |
null |
2025-03-12 |
GIGP: A Global Information Interacting and Geometric Priors Focusing Framework for Semi-supervised Medical Image Segmentation |
Lianyuan Yu et.al. |
2503.09355 |
null |
2025-03-17 |
Dual-Domain Homogeneous Fusion with Cross-Modal Mamba and Progressive Decoder for 3D Object Detection |
Xuzhong Hu et.al. |
2503.08992 |
null |
2025-03-11 |
MinGRU-Based Encoder for Turbo Autoencoder Frameworks |
Rick Fritschek et.al. |
2503.08451 |
null |
2025-03-11 |
EnergyFormer: Energy Attention with Fourier Embedding for Hyperspectral Image Classification |
Saad Sohail et.al. |
2503.08239 |
null |
2025-03-10 |
MambaFlow: A Mamba-Centric Architecture for End-to-End Optical Flow Estimation |
Juntian Du et.al. |
2503.07046 |
null |
2025-03-10 |
HiSTF Mamba: Hierarchical Spatiotemporal Fusion with Multi-Granular Body-Spatial Modeling for High-Fidelity Text-to-Motion Generation |
Xingzu Zhan et.al. |
2503.06897 |
null |
2025-03-10 |
From Image- to Pixel-level: Label-efficient Hyperspectral Image Reconstruction |
Yihong Leng et.al. |
2503.06852 |
null |
2025-03-09 |
Global-Aware Monocular Semantic Scene Completion with State Space Models |
Shijie Li et.al. |
2503.06569 |
null |
2025-03-09 |
Future-Aware Interaction Network For Motion Forecasting |
Shijie Li et.al. |
2503.06565 |
null |
2025-03-09 |
M $^3$ amba: CLIP-driven Mamba Model for Multi-modal Remote Sensing Classification |
Mingxiang Cao et.al. |
2503.06446 |
link |
2025-03-09 |
Swift Hydra: Self-Reinforcing Generative Framework for Anomaly Detection with Multiple Mamba Models |
Nguyen Do et.al. |
2503.06413 |
link |
2025-03-07 |
Growth-fragmentation model for a population presenting heterogeneity in growth rate: Malthus parameter and long-time behavior |
Anaïs Rat et.al. |
2503.05232 |
null |
2025-03-06 |
Spectral Informed Mamba for Robust Point Cloud Processing |
Ali Bahri et.al. |
2503.04953 |
null |
2025-03-06 |
Token-Efficient Long Video Understanding for Multimodal LLMs |
Jindong Jiang et.al. |
2503.04130 |
null |
2025-03-05 |
JamMa: Ultra-lightweight Local Feature Matching with Joint Mamba |
Xiaoyong Lu et.al. |
2503.03437 |
null |
2025-03-04 |
XFMamba: Cross-Fusion Mamba for Multi-View Medical Image Classification |
Xiaoyu Zheng et.al. |
2503.02619 |
null |
2025-03-14 |
COMMA: Coordinate-aware Modulated Mamba Network for 3D Dispersed Vessel Segmentation |
Gen Shi et.al. |
2503.02332 |
link |
2025-03-03 |
Mamba base PKD for efficient knowledge compression |
José Medina et.al. |
2503.01727 |
null |
2025-03-03 |
SparseMamba-PCL: Scribble-Supervised Medical Image Segmentation via SAM-Guided Progressive Collaborative Learning |
Luyi Qiu et.al. |
2503.01633 |
link |
2025-03-03 |
From Claims to Evidence: A Unified Framework and Critical Analysis of CNN vs. Transformer vs. Mamba in Medical Image Segmentation |
Pooya Mohammadi Kazaj et.al. |
2503.01306 |
link |
2025-03-03 |
DCAMamba: Mamba-based Rapid Response DC Arc Fault Detection |
Lukun Wang et.al. |
2503.01264 |
null |
2025-03-09 |
SCSegamba: Lightweight Structure-Aware Vision Mamba for Crack Segmentation in Structures |
Hui Liu et.al. |
2503.01113 |
link |
2025-03-02 |
LightEndoStereo: A Real-time Lightweight Stereo Matching Method for Endoscopy Images |
Yang Ding et.al. |
2503.00731 |
link |
2025-03-01 |
2DMCG:2DMambawith Change Flow Guidance for Change Detection in Remote Sensing |
JunYao Kaung et.al. |
2503.00521 |
null |
2025-02-28 |
Visual Attention Exploration in Vision-Based Mamba Models |
Junpeng Wang et.al. |
2502.20764 |
null |
2025-02-28 |
A Quantum-Empowered SPEI Drought Forecasting Algorithm Using Spatially-Aware Mamba Network |
Po-Wei Tang et.al. |
2502.20703 |
null |
2025-02-27 |
Thinking Slow, Fast: Scaling Inference Compute with Distilled Reasoners |
Daniele Paliotta et.al. |
2502.20339 |
null |
2025-02-12 |
scMamba: A Pre-Trained Model for Single-Nucleus RNA Sequencing Analysis in Neurodegenerative Disorders |
Gyutaek Oh et.al. |
2502.19429 |
null |
2025-02-26 |
EndoMamba: An Efficient Foundation Model for Endoscopic Videos |
Qingyao Tian et.al. |
2502.19090 |
link |
2025-03-05 |
A Reverse Mamba Attention Network for Pathological Liver Segmentation |
Jun Zeng et.al. |
2502.18232 |
link |
2025-02-25 |
ExPath: Towards Explaining Targeted Pathways for Biological Knowledge Bases |
Rikuto Kotoge et.al. |
2502.18026 |
null |
2025-02-28 |
Toward Foundational Model for Sleep Analysis Using a Multimodal Hybrid Self-Supervised Learning Framework |
Cheol-Hui Lee et.al. |
2502.17481 |
link |
2025-02-24 |
MDN: Mamba-Driven Dualstream Network For Medical Hyperspectral Image Segmentation |
Shijie Lin et.al. |
2502.17255 |
null |
2025-02-24 |
MambaFlow: A Novel and Flow-guided State Space Model for Scene Flow Estimation |
Jiehao Luo et.al. |
2502.16907 |
link |
2025-02-23 |
Intrinsic Model Weaknesses: How Priming Attacks Unveil Vulnerabilities in Large Language Models |
Yuyi Huang et.al. |
2502.16491 |
null |
2025-02-23 |
MAPN: Enhancing Heterogeneous Sparse Graph Representation by Mamba-based Asynchronous Aggregation |
Xuqi Mao et.al. |
2502.16454 |
null |
2025-02-28 |
SalM $^{2}$ : An Extremely Lightweight Saliency Mamba Model for Real-Time Cognitive Awareness of Driver Attention |
Chunyu Zhao et.al. |
2502.16214 |
link |
2025-02-22 |
Improving Speech Enhancement by Cross- and Sub-band Processing with State Space Model |
Jizhen Li et.al. |
2502.16207 |
null |
2025-02-09 |
CacheMamba: Popularity Prediction for Mobile Edge Caching Networks via Selective State Spaces |
Ghazaleh Kianfar et.al. |
2502.15746 |
null |
2025-02-24 |
LaTIM: Measuring Latent Token-to-Token Interactions in Mamba Models |
Hugo Pitorro et.al. |
2502.15612 |
null |
2025-02-21 |
LightMamba: Efficient Mamba Acceleration on FPGA with Quantization and Hardware Co-design |
Renjie Wei et.al. |
2502.15260 |
null |
2025-02-21 |
TransMamba: Fast Universal Architecture Adaption from Transformers to Mamba |
Xiuwei Chen et.al. |
2502.15130 |
null |
2025-02-17 |
High-Dynamic Radar Sequence Prediction for Weather Nowcasting Using Spatiotemporal Coherent Gaussian Representation |
Ziye Wang et.al. |
2502.14895 |
null |
2025-02-20 |
Multiscale Byte Language Models – A Hierarchical Architecture for Causal Million-Length Sequence Modeling |
Eric Egli et.al. |
2502.14553 |
link |
2025-02-23 |
Llamba: Scaling Distilled Recurrent Models for Efficient Language Processing |
Aviv Bick et.al. |
2502.14458 |
null |
2025-02-20 |
Topology-Aware Wavelet Mamba for Airway Structure Segmentation in Postoperative Recurrent Nasopharyngeal Carcinoma CT Scans |
Haishan Huang et.al. |
2502.14363 |
null |
2025-02-19 |
MambaLiteSR: Image Super-Resolution with Low-Rank Mamba using Knowledge Distillation |
Romina Aalishah et.al. |
2502.14090 |
null |
2025-02-19 |
Medical Image Classification with KAN-Integrated Transformers and Dilated Neighborhood Attention |
Omid Nejati Manzari et.al. |
2502.13693 |
link |
2025-02-19 |
CardiacMamba: A Multimodal RGB-RF Fusion Framework with State Space Models for Remote Physiological Measurement |
Zheng Wu et.al. |
2502.13624 |
null |
2025-03-06 |
MobileViM: A Light-weight and Dimension-independent Vision Mamba for 3D Medical Image Analysis |
Wei Dai et.al. |
2502.13524 |
link |
2025-02-19 |
SNN-Driven Multimodal Human Action Recognition via Event Camera and Skeleton Data Fusion |
Naichuan Zheng et.al. |
2502.13385 |
null |
2025-02-18 |
Benchmarking Post-Training Quantization in LLMs: Comprehensive Taxonomy, Unified Evaluation, and Comparative Analysis |
Jiaqi Zhao et.al. |
2502.13178 |
link |
2025-02-18 |
Multimodal Mamba: Decoder-only Multimodal State Space Model via Quadratic to Linear Distillation |
Bencheng Liao et.al. |
2502.13145 |
link |
2025-02-18 |
DAMamba: Vision State Space Model with Dynamic Adaptive Scan |
Tanzhe Li et.al. |
2502.12627 |
link |
2025-02-19 |
X-IL: Exploring the Design Space of Imitation Learning Policies |
Xiaogang Jia et.al. |
2502.12330 |
link |
2025-02-17 |
LLMs on the Line: Data Determines Loss-to-Loss Scaling Laws |
Prasanna Mayilvahanan et.al. |
2502.12120 |
null |
2025-02-17 |
S2TX: Cross-Attention Multi-Scale State-Space Transformer for Time Series Forecasting |
Zihao Wu et.al. |
2502.11340 |
null |
2025-02-16 |
RT-DEMT: A hybrid real-time acupoint detection model combining mamba and transformer |
Shilong Yang et.al. |
2502.11179 |
link |
2025-02-16 |
DAViMNet: SSMs-Based Domain Adaptive Object Detection |
A. Enes Doruk et.al. |
2502.11178 |
link |
2025-02-16 |
DuplexMamba: Enhancing Real-time Speech Conversations with Duplex and Streaming Capabilities |
Xiangyu Lu et.al. |
2502.11123 |
link |
2025-02-14 |
DeltaProduct: Increasing the Expressivity of DeltaNet Through Products of Householders |
Julien Siems et.al. |
2502.10297 |
link |
2025-02-14 |
From Markov to Laplace: How Mamba In-Context Learns Markov Chains |
Marco Bondaschi et.al. |
2502.10178 |
link |
2025-02-14 |
EmbBERT-Q: Breaking Memory Barriers in Embedded NLP |
Riccardo Bravin et.al. |
2502.10001 |
null |
2025-02-14 |
A Lightweight and Effective Image Tampering Localization Network with Vision Mamba |
Kun Guo et.al. |
2502.09941 |
link |
2025-02-19 |
MAAT: Mamba Adaptive Anomaly Transformer with association discrepancy for time series |
Abdellah Zakaria Sellam et.al. |
2502.07858 |
link |
2025-02-11 |
NARCE: A Mamba-Based Neural Algorithmic Reasoner Framework for Online Complex Event Detection |
Liying Han et.al. |
2502.07250 |
null |
2025-02-11 |
A Survey on Mamba Architecture for Vision Applications |
Fady Ibrahim et.al. |
2502.07161 |
null |
2025-02-10 |
Is Long Range Sequential Modeling Necessary For Colorectal Tumor Segmentation? |
Abhishek Srivastava et.al. |
2502.07120 |
null |
2025-02-10 |
FinMamba: Market-Aware Graph Enhanced Multi-Level Mamba for Stock Movement Prediction |
Yifan Hu et.al. |
2502.06707 |
link |
2025-02-10 |
FEMBA: Efficient and Scalable EEG Analysis with a Bidirectional Mamba Foundation Model |
Anna Tegon et.al. |
2502.06438 |
null |
2025-02-10 |
Hybrid State-Space and GRU-based Graph Tokenization Mamba for Hyperspectral Image Classification |
Muhammad Ahmad et.al. |
2502.06427 |
null |
2025-02-10 |
Multi-Level Decoupled Relational Distillation for Heterogeneous Architectures |
Yaoxin Yang et.al. |
2502.06189 |
null |
2025-02-07 |
Before It’s Too Late: A State Space Model for the Early Prediction of Misinformation and Disinformation Engagement |
Lin Tian et.al. |
2502.04655 |
link |
2025-02-07 |
The $α$ -Alternator: Dynamic Adaptation To Varying Noise Levels In Sequences Using The Vendi Score For Improved Robustness and Performance |
Mohammad Reza Rezaei et.al. |
2502.04593 |
null |
2025-02-06 |
Scaling Laws in Patchification: An Image Is Worth 50,176 Tokens And More |
Feng Wang et.al. |
2502.03738 |
null |
2025-02-05 |
Proxy Prompt: Endowing SAM and SAM 2 with Auto-Interactive-Prompt for Medical Segmentation |
Wang Xinyi et.al. |
2502.03501 |
null |
2025-02-04 |
On the Expressivity of Selective State-Space Layers: A Multivariate Polynomial Approach |
Edo Cohen-Karlik et.al. |
2502.02209 |
null |
2025-02-04 |
UD-Mamba: A pixel-level uncertainty-driven Mamba model for medical image segmentation |
Weiren Zhao et.al. |
2502.02024 |
link |
2025-02-04 |
DCT-Mamba3D: Spectral Decorrelation and Spatial-Spectral Feature Extraction for Hyperspectral Image Classification |
Weijia Cao et.al. |
2502.01986 |
null |
2025-02-03 |
Generalization Error Analysis for Selective State-Space Models Through the Lens of Attention |
Arya Honarpisheh et.al. |
2502.01473 |
null |
2025-02-03 |
Deep Active Speech Cancellation with Multi-Band Mamba Network |
Yehuda Mishaly et.al. |
2502.01185 |
null |
2025-02-01 |
Fast Vision Mamba: Pooling Spatial Dimensions for Accelerated Processing |
Saarthak Kapse et.al. |
2502.00594 |
null |
2025-02-01 |
Enhancing Memory and Imagination Consistency in Diffusion-based World Models via Linear-Time Sequence Modeling |
Jia-Hua Lee et.al. |
2502.00466 |
null |
2025-02-01 |
MambaGlue: Fast and Robust Local Feature Matching With Mamba |
Kihwan Ryoo et.al. |
2502.00462 |
link |
2025-02-01 |
A Study on the Performance of U-Net Modifications in Retroperitoneal Tumor Segmentation |
Moein Heidari et.al. |
2502.00314 |
link |
2025-01-30 |
HSRMamba: Contextual Spatial-Spectral State Space Model for Single Hyperspectral Super-Resolution |
Shi Chen et.al. |
2501.18500 |
link |
2025-01-31 |
MatIR: A Hybrid Mamba-Transformer Image Restoration Model |
Juan Wen et.al. |
2501.18401 |
link |
2025-01-28 |
Mamba-Shedder: Post-Transformer Compression for Efficient Selective Structured State Space Models |
J. Pablo Muñoz et.al. |
2501.17088 |
link |
2025-02-13 |
Post-Training Quantization for Vision Mamba with k-Scaled Quantization and Reparameterization |
Bo-Yun Shi et.al. |
2501.16738 |
null |
2025-01-27 |
Directing Mamba to Complex Textures: An Efficient Texture-Aware State Space Model for Image Restoration |
Long Peng et.al. |
2501.16583 |
null |
2025-01-25 |
MambaTron: Efficient Cross-Modal Point Cloud Enhancement using Aggregate Selective State Space Modeling |
Sai Tarun Inaganti et.al. |
2501.16384 |
null |
2025-01-27 |
Mixture-of-Mamba: Enhancing Multi-Modal State-Space Models with Modality-Aware Sparsity |
Weixin Liang et.al. |
2501.16295 |
link |
2025-01-27 |
Application of Structured State Space Models to High energy physics with locality-sensitive hashing |
Cheng Jiang et.al. |
2501.16237 |
null |
2025-01-26 |
Mamba-Based Graph Convolutional Networks: Tackling Over-smoothing with Selective State Space |
Xin He et.al. |
2501.15461 |
null |
2025-01-26 |
CD-Lamba: Boosting Remote Sensing Change Detection via a Cross-Temporal Locally Adaptive State Space Model |
Zhenkai Wu et.al. |
2501.15455 |
link |
2025-02-14 |
Surface Vision Mamba: Leveraging Bidirectional State Space Model for Efficient Spherical Manifold Representation |
Rongzhao He et.al. |
2501.14679 |
null |
2025-01-24 |
State Space Models for Extractive Summarization in Low Resource Scenarios |
Nisrine Ait Khayi et.al. |
2501.14673 |
null |
2025-01-31 |
A Comprehensive Framework for Semantic Similarity Analysis of Human and AI-Generated Text Using Transformer Architectures and Ensemble Techniques |
Lifu Gao et.al. |
2501.14288 |
null |
2025-01-23 |
MV-GMN: State Space Model for Multi-View Action Recognition |
Yuhui Lin et.al. |
2501.13829 |
null |
2025-02-06 |
MambaQuant: Quantizing the Mamba Family with Variance Aligned Rotation Methods |
Zukang Xu et.al. |
2501.13484 |
link |
2025-01-23 |
Contrast: A Hybrid Architecture of Transformers and State Space Models for Low-Level Vision |
Aman Urumbekov et.al. |
2501.13353 |
null |
2025-01-22 |
UniUIR: Considering Underwater Image Restoration as An All-in-One Learner |
Xu Zhang et.al. |
2501.12981 |
null |
2025-01-22 |
LiT: Delving into a Simplified Linear Diffusion Transformer for Image Generation |
Jiahao Wang et.al. |
2501.12976 |
null |
2025-01-21 |
Parallel Sequence Modeling via Generalized Spatial Propagation Network |
Hongjun Wang et.al. |
2501.12381 |
null |
2025-01-21 |
SMamba: Sparse Mamba for Event-based Object Detection |
Nan Yang et.al. |
2501.11971 |
link |
2025-01-21 |
GLAM: Global-Local Variation Awareness in Mamba-based World Model |
Qian He et.al. |
2501.11949 |
null |
2025-01-20 |
SeRpEnt: Selective Resampling for Expressive State Space Models |
Stefano Rando et.al. |
2501.11729 |
null |
2025-01-20 |
WSSM: Geographic-enhanced hierarchical state-space model for global station weather forecast |
Songru Yang et.al. |
2501.11238 |
null |
2025-01-03 |
SSM2Mel: State Space Model to Reconstruct Mel Spectrogram from the EEG |
Cunhang Fan et.al. |
2501.10402 |
null |
2025-01-16 |
WMamba: Wavelet-based Mamba for Face Forgery Detection |
Siran Peng et.al. |
2501.09617 |
null |
2025-01-15 |
MANTA: Diffusion Mamba for Efficient and Effective Stochastic Long-Term Dense Anticipation |
Olga Zatsarynna et.al. |
2501.08837 |
null |
2025-01-14 |
DM-Mamba: Dual-domain Multi-scale Mamba for MRI reconstruction |
Yucong Meng et.al. |
2501.08163 |
link |
2025-01-14 |
AVS-Mamba: Exploring Temporal and Multi-modal Mamba for Audio-Visual Segmentation |
Sitong Gong et.al. |
2501.07810 |
link |
2025-01-13 |
Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion |
Li Liang et.al. |
2501.07260 |
link |
2025-01-13 |
MSV-Mamba: A Multiscale Vision Mamba Network for Echocardiography Segmentation |
Xiaoxian Yang et.al. |
2501.07120 |
null |
2025-01-12 |
Mamba-MOC: A Multicategory Remote Object Counting via State Space Model |
Peng Liu et.al. |
2501.06697 |
null |
2025-01-10 |
MS-Temba : Multi-Scale Temporal Mamba for Efficient Temporal Action Detection |
Arkaprava Sinha et.al. |
2501.06138 |
link |
2025-01-10 |
Comparison Between Effective and Individual Fitness in a Heterogeneous Population |
Marie Doumic et.al. |
2501.05751 |
link |
2025-01-09 |
MambaHSI: Spatial-Spectral Mamba for Hyperspectral Image Classification |
Yapeng Li et.al. |
2501.04944 |
link |
2025-01-08 |
EDMB: Edge Detector with Mamba |
Yachuan Li et.al. |
2501.04846 |
link |
2025-01-08 |
H-MBA: Hierarchical MamBa Adaptation for Multi-Modal Video Understanding in Autonomous Driving |
Siran Chen et.al. |
2501.04302 |
null |
2025-01-11 |
GLFC: Unified Global-Local Feature and Contrast Learning with Mamba-Enhanced UNet for Synthetic CT Generation from CBCT |
Xianhao Zhou et.al. |
2501.02992 |
link |
2025-01-08 |
Samba-ASR: State-Of-The-Art Speech Recognition Leveraging Structured State-Space Models |
Syed Abdul Gaffar Shakhadri et.al. |
2501.02832 |
null |
2025-01-05 |
KM-UNet KAN Mamba UNet for medical image segmentation |
Yibo Zhang et.al. |
2501.02559 |
link |
2025-01-03 |
A Separable Self-attention Inspired by the State Space Model for Computer Vision |
Juntao Zhang et.al. |
2501.02040 |
link |
2025-01-03 |
Improved Feature Extraction Network for Neuro-Oriented Target Speaker Extraction |
Cunhang Fan et.al. |
2501.01673 |
null |
2025-01-03 |
Merging Context Clustering with Visual State Space Models for Medical Image Segmentation |
Yun Zhu et.al. |
2501.01618 |
link |
2025-01-02 |
A Unified Hyperparameter Optimization Pipeline for Transformer-Based Time Series Forecasting Models |
Jingjing Xu et.al. |
2501.01394 |
link |
2025-01-02 |
Detail Matters: Mamba-Inspired Joint Unfolding Network for Snapshot Spectral Compressive Imaging |
Mengjie Qin et.al. |
2501.01262 |
link |
2025-01-02 |
CryptoMamba: Leveraging State Space Models for Accurate Bitcoin Price Prediction |
Mohammad Shahab Sepehri et.al. |
2501.01010 |
link |
2025-01-01 |
HCMA-UNet: A Hybrid CNN-Mamba UNet with Inter-Slice Self-Attention for Efficient Breast Cancer Segmentation |
Haoxuan Li et.al. |
2501.00751 |
link |
2024-12-29 |
PTQ4VM: Post-Training Quantization for Visual Mamba |
Younghyun Cho et.al. |
2412.20386 |
link |
2024-12-28 |
STNMamba: Mamba-based Spatial-Temporal Normality Learning for Video Anomaly Detection |
Zhangxun Li et.al. |
2412.20084 |
null |
2024-12-28 |
MambaVO: Deep Visual Odometry Based on Sequential Matching Refinement and Training Smoothing |
Shuo Wang et.al. |
2412.20082 |
null |
2024-12-28 |
MaIR: A Locality- and Continuity-Preserving Mamba for Image Restoration |
Boyun Li et.al. |
2412.20066 |
link |
2024-12-28 |
DepthMamba with Adaptive Fusion |
Zelin Meng et.al. |
2412.19964 |
null |
2024-12-26 |
Completion as Enhancement: A Degradation-Aware Selective Image Guided Network for Depth Completion |
Zhiqiang Yan et.al. |
2412.19225 |
null |
2024-12-26 |
BSDB-Net: Band-Split Dual-Branch Network with Selective State Spaces Mechanism for Monaural Speech Enhancement |
Cunhang Fan et.al. |
2412.19099 |
null |
2024-12-24 |
Exploring Graph Mamba: A Comprehensive Survey on State-Space Models for Graph Learning |
Safa Ben Atitallah et.al. |
2412.18322 |
null |
2024-12-24 |
U-Mamba-Net: A highly efficient Mamba-based U-net style network for noisy and reverberant speech separation |
Shaoxiang Dang et.al. |
2412.18217 |
null |
2024-12-24 |
COMO: Cross-Mamba Interaction and Offset-Guided Fusion for Multimodal Object Detection |
Chang Liu et.al. |
2412.18076 |
null |
2024-12-23 |
BrainMAP: Learning Multiple Activation Pathways in Brain Networks |
Song Wang et.al. |
2412.17404 |
link |
2024-12-23 |
VarAD: Lightweight High-Resolution Image Anomaly Detection via Visual Autoregressive Modeling |
Yunkang Cao et.al. |
2412.17263 |
link |
2024-12-22 |
Temporal-Frequency State Space Duality: An Efficient Paradigm for Speech Emotion Recognition |
Jiaqi Zhao et.al. |
2412.16904 |
null |
2025-01-10 |
ViM-Disparity: Bridging the Gap of Speed, Accuracy and Memory for Disparity Map Generation |
Maheswar Bora et.al. |
2412.16745 |
link |
2024-12-28 |
Lillama: Large Language Models Compression via Low-Rank Feature Distillation |
Yaya Sy et.al. |
2412.16719 |
null |
2024-12-21 |
From Pixels to Gigapixels: Bridging Local Inductive Bias and Long-Range Dependencies with Pixel-Mamba |
Zhongwei Qiu et.al. |
2412.16711 |
null |
2025-01-02 |
Mamba-SEUNet: Mamba UNet for Monaural Speech Enhancement |
Junyu Wang et.al. |
2412.16626 |
null |
2025-01-07 |
Trusted Mamba Contrastive Network for Multi-View Clustering |
Jian Zhu et.al. |
2412.16487 |
link |
2024-12-21 |
MOL-Mamba: Enhancing Molecular Representation with Structural & Electronic Insights |
Jingjing Hu et.al. |
2412.16483 |
link |
2024-12-20 |
BabyHGRN: Exploring RNNs for Sample-Efficient Training of Language Models |
Patrick Haller et.al. |
2412.15978 |
null |
2024-12-20 |
Mamba-based Deep Learning Approaches for Sleep Staging on a Wireless Multimodal Wearable System without Electroencephalography |
Andrew H. Zhang et.al. |
2412.15947 |
null |
2024-12-20 |
Multi-dimensional Visual Prompt Enhanced Image Restoration via Mamba-Transformer Aggregation |
Aiwen Jiang et.al. |
2412.15845 |
link |
2024-12-20 |
Exploiting Multimodal Spatial-temporal Patterns for Video Object Tracking |
Xiantao Hu et.al. |
2412.15691 |
link |
2024-12-19 |
{S $^3$ -Mamba}: Small-Size-Sensitive Mamba for Lesion Segmentation |
Gui Wang et.al. |
2412.14546 |
null |
2024-12-19 |
Efficient Self-Supervised Video Hashing with Selective State Spaces |
Jinpeng Wang et.al. |
2412.14518 |
link |
2024-12-18 |
State Space Models are Strong Text Rerankers |
Zhichao Xu et.al. |
2412.14354 |
null |
2024-12-18 |
MambaLCT: Boosting Tracking via Long-term Context State Space Model |
Xiaohai Li et.al. |
2412.13615 |
link |
2024-12-18 |
Robust Tracking via Mamba-based Context-aware Token Learning |
Jinxia Xie et.al. |
2412.13611 |
link |
2025-01-01 |
TAME: Temporal Audio-based Mamba for Enhanced Drone Trajectory Estimation and Classification |
Zhenyuan Xiao et.al. |
2412.13037 |
link |
2024-12-17 |
Faster Vision Mamba is Rebuilt in Minutes via Merged Token Re-training |
Mingjia Shi et.al. |
2412.12496 |
link |
2024-12-17 |
GG-SSMs: Graph-Generating State Space Models |
Nikola Zubić et.al. |
2412.12423 |
null |
2024-12-15 |
A Comparative Study on Dynamic Graph Embedding based on Mamba and Transformers |
Ashish Parmanand Pandey et.al. |
2412.11293 |
null |
2024-12-15 |
Light-T2M: A Lightweight and Fast Model for Text-to-motion Generation |
Ling-An Zeng et.al. |
2412.11193 |
link |
2024-12-15 |
OccScene: Semantic Occupancy-based Cross-task Mutual Learning for 3D Scene Generation |
Bohan Li et.al. |
2412.11183 |
null |
2024-12-15 |
BarcodeMamba: State Space Models for Biodiversity Analysis |
Tiancheng Gao et.al. |
2412.11084 |
link |
2024-12-15 |
Exploring Enhanced Contextual Information for Video-Level Object Tracking |
Ben Kang et.al. |
2412.11023 |
link |
2024-12-14 |
MASV: Speaker Verification with Global and Local Context Mamba |
Yang Liu et.al. |
2412.10989 |
null |
2024-12-14 |
MambaPro: Multi-Modal Object Re-Identification with Mamba Aggregation and Synergistic Prompt |
Yuhao Wang et.al. |
2412.10707 |
link |
2024-12-13 |
XYScanNet: An Interpretable State Space Model for Perceptual Image Deblurring |
Hanzhou Liu et.al. |
2412.10338 |
null |
2024-12-13 |
SCBench: A KV Cache-Centric Analysis of Long-Context Methods |
Yucheng Li et.al. |
2412.10319 |
null |
2024-12-13 |
Selective State Space Memory for Large Vision-Language Models |
Chee Ng et.al. |
2412.09875 |
null |
2024-12-13 |
LinGen: Towards High-Resolution Minute-Length Text-to-Video Generation with Linear Computational Complexity |
Hongjie Wang et.al. |
2412.09856 |
null |
2024-12-12 |
Optimising TinyML with Quantization and Distillation of Transformer and Mamba Models for Indoor Localisation on Edge Devices |
Thanaphon Suwannaphong et.al. |
2412.09289 |
null |
2024-12-12 |
Selective Visual Prompting in Vision Mamba |
Yifeng Yao et.al. |
2412.08947 |
link |
2024-12-11 |
SAM-Mamba: Mamba Guided SAM Architecture for Generalized Zero-Shot Polyp Segmentation |
Tapas Kumar Dutta et.al. |
2412.08482 |
link |
2024-12-11 |
LOMA: Language-assisted Semantic Occupancy Network via Triplane Mamba |
Yubo Cui et.al. |
2412.08388 |
null |
2024-12-19 |
DG-Mamba: Robust and Efficient Dynamic Graph Structure Learning with Selective State Space Models |
Haonan Yuan et.al. |
2412.08160 |
link |
2024-12-23 |
Manta: Enhancing Mamba for Few-Shot Action Recognition of Long Sub-Sequence |
Wenbo Huang et.al. |
2412.07481 |
null |
2024-12-10 |
Bidirectional Mamba state-space model for anomalous diffusion |
Maxime Lavaud et.al. |
2412.07299 |
null |
2024-12-10 |
MPSI: Mamba enhancement model for pixel-wise sequential interaction Image Super-Resolution |
Yuchun He et.al. |
2412.07222 |
null |
2024-12-09 |
MSCrackMamba: Leveraging Vision Mamba for Crack Detection in Fused Multispectral Imagery |
Qinfeng Zhu et.al. |
2412.06211 |
null |
2024-12-09 |
The Computational Limits of State-Space Models and Mamba via the Lens of Circuit Complexity |
Yifang Chen et.al. |
2412.06148 |
null |
2024-12-06 |
MSECG: Incorporating Mamba for Robust and Efficient ECG Super-Resolution |
Jie Lin et.al. |
2412.04861 |
null |
2024-12-03 |
Segmentation of Coronary Artery Stenosis in X-ray Angiography using Mamba Models |
Ali Rostami et.al. |
2412.02568 |
null |
2024-12-02 |
MamKPD: A Simple Mamba Baseline for Real-Time 2D Keypoint Detection |
Yonghao Dang et.al. |
2412.01422 |
null |
2024-12-02 |
MambaU-Lite: A Lightweight Model based on Mamba and Integrated Channel-Spatial Attention for Skin Lesion Segmentation |
Thi-Nhu-Quynh Nguyen et.al. |
2412.01405 |
link |
2024-12-01 |
AlignMamba: Enhancing Multimodal Mamba with Local and Global Cross-modal Alignment |
Yan Li et.al. |
2412.00833 |
null |
2024-12-01 |
Learning Mamba as a Continual Learner |
Chongyang Zhao et.al. |
2412.00776 |
null |
2024-12-01 |
Decision Transformer vs. Decision Mamba: Analysing the Complexity of Sequential Decision Making in Atari Games |
Ke Yan et.al. |
2412.00725 |
link |
2024-12-01 |
2DMamba: Efficient State Space Model for Image Representation with Applications on Giga-Pixel Whole Slide Image Classification |
Jingwei Zhang et.al. |
2412.00678 |
link |
2024-12-01 |
MambaNUT: Nighttime UAV Tracking via Mamba and Adaptive Curriculum Learning |
You Wu et.al. |
2412.00626 |
link |
2024-11-29 |
RMIO: A Model-Based MARL Framework for Scenarios with Observation Loss in Some Agents |
Shi Zifeng et.al. |
2411.19639 |
null |
2024-11-28 |
MSEMG: Surface Electromyography Denoising with a Mamba-based Efficient Network |
Yu-Tung Liu et.al. |
2411.18902 |
link |
2024-11-27 |
Vision Mamba Distillation for Low-resolution Fine-grained Image Classification |
Yao Chen et.al. |
2411.17980 |
link |
2024-11-26 |
MTS-UNMixers: Multivariate Time Series Forecasting via Channel-Time Dual Unmixing |
Xuanbing Zhu et.al. |
2411.17770 |
link |
2024-11-26 |
TinyViM: Frequency Decoupling for Tiny Hybrid Vision Mamba |
Xiaowen Ma et.al. |
2411.17473 |
link |
2024-11-26 |
On the Efficiency of NLP-Inspired Methods for Tabular Deep Learning |
Anton Frederik Thielmann et.al. |
2411.17207 |
link |
2024-11-25 |
Deformable Mamba for Wide Field of View Segmentation |
Jie Hu et.al. |
2411.16481 |
link |
2024-11-25 |
M3: Mamba-assisted Multi-Circuit Optimization via MBRL with Effective Scheduling |
Youngmin Oh et.al. |
2411.16019 |
null |
2024-11-24 |
MobileMamba: Lightweight Multi-Receptive Visual Mamba Network |
Haoyang He et.al. |
2411.15941 |
link |
2024-11-24 |
MambaTrack: Exploiting Dual-Enhancement for Night UAV Tracking |
Chunhui Zhang et.al. |
2411.15761 |
link |
2024-11-23 |
Mamba-CL: Optimizing Selective State Space Model in Null Space for Continual Learning |
De Cheng et.al. |
2411.15469 |
null |
2024-11-23 |
MambaVLT: Time-Evolving Multimodal State Space Model for Vision-Language Tracking |
Xinqi Liu et.al. |
2411.15459 |
null |
2024-11-23 |
MUFM: A Mamba-Enhanced Feedback Model for Micro Video Popularity Prediction |
Jiacheng Lu et.al. |
2411.15455 |
null |
2024-11-22 |
MambaIRv2: Attentive State Space Restoration |
Hang Guo et.al. |
2411.15269 |
link |
2024-11-22 |
OSMamba: Omnidirectional Spectral Mamba with Dual-Domain Prior Generator for Exposure Correction |
Gehui Li et.al. |
2411.15255 |
null |
2024-11-22 |
EfficientViM: Efficient Vision Mamba with Hidden State Mixer based State Space Duality |
Sanghyeok Lee et.al. |
2411.15241 |
link |
2024-11-21 |
Parameter Efficient Mamba Tuning via Projector-targeted Diagonal-centric Linear Transformation |
Seokil Ham et.al. |
2411.15224 |
null |
2024-11-21 |
BEST-STD: Bidirectional Mamba-Enhanced Speech Tokenization for Spoken Term Detection |
Anup Singh et.al. |
2411.14100 |
link |
2024-11-22 |
STREAM: A Universal State-Space Model for Sparse Geometric Data |
Mark Schöne et.al. |
2411.12603 |
null |
2024-12-06 |
Unlocking State-Tracking in Linear RNNs Through Negative Eigenvalues |
Riccardo Grazzi et.al. |
2411.12537 |
link |
2024-11-19 |
Contrast Similarity-Aware Dual-Pathway Mamba for Multivariate Time Series Node Classification |
Mingsen Du et.al. |
2411.12222 |
null |
2024-11-18 |
KAN-Mamba FusionNet: Redefining Medical Image Segmentation with Non-Linear Modeling |
Akansh Agrawal et.al. |
2411.11926 |
null |
2024-11-16 |
$\text{S}^{3}$ Mamba: Arbitrary-Scale Super-Resolution via Scaleable State Space Model |
Peizhe Xia et.al. |
2411.11906 |
null |
2024-11-18 |
Bi-Mamba: Towards Accurate 1-Bit State Space Models |
Shengkun Tang et.al. |
2411.11843 |
null |
2024-11-18 |
RAWMamba: Unified sRGB-to-RAW De-rendering With State Space Model |
Hongjun Chen et.al. |
2411.11717 |
null |
2024-11-15 |
SoftLMs: Efficient Adaptive Low-Rank Approximation of Language Models using Soft-Thresholding Mechanism |
Priyansh Bhatnagar et.al. |
2411.10543 |
null |
2024-11-15 |
M-VAR: Decoupled Scale-wise Autoregressive Modeling for High-Quality Image Generation |
Sucheng Ren et.al. |
2411.10433 |
link |
2024-11-15 |
XLSR-Mamba: A Dual-Column Bidirectional State Space Model for Spoofing Attack Detection |
Yang Xiao et.al. |
2411.10027 |
link |
2024-11-25 |
When Mamba Meets xLSTM: An Efficient and Precise Method with the XLSTM-VMUNet Model for Skin lesion Segmentation |
Zhuoyi Fang et.al. |
2411.09363 |
link |
2024-11-13 |
Multimodal Instruction Tuning with Hybrid State Space Models |
Jianing Zhou et.al. |
2411.08840 |
null |
2024-11-13 |
MambaXCTrack: Mamba-based Tracker with SSM Cross-correlation and Motion Prompt for Ultrasound Needle Tracking |
Yuelin Zhang et.al. |
2411.08395 |
null |
2024-11-29 |
CT-Mamba: A Hybrid Convolutional State Space Model for Low-Dose CT Denoising |
Linxuan Li et.al. |
2411.07930 |
link |
2024-11-12 |
Diverse capability and scaling of diffusion and auto-regressive models when learning abstract rules |
Binxu Wang et.al. |
2411.07873 |
null |
2024-11-12 |
CDXFormer: Boosting Remote Sensing Change Detection with Extended Long Short-Term Memory |
Zhenkai Wu et.al. |
2411.07863 |
link |
2024-11-12 |
SAV-SE: Scene-aware Audio-Visual Speech Enhancement with Selective State Space Model |
Xinyuan Qian et.al. |
2411.07751 |
null |
2024-11-12 |
MaDiNet: Mamba Diffusion Network for SAR Target Detection |
Jie Zhou et.al. |
2411.07500 |
link |
2024-11-11 |
AEROMamba: An efficient architecture for audio super-resolution using generative adversarial networks and state space models |
Wallace Abreu et.al. |
2411.07364 |
link |
2024-11-11 |
Mamba-based Decoder-Only Approach with Bidirectional Speech Modeling for Speech Recognition |
Yoshiki Masuyama et.al. |
2411.06968 |
link |
2024-11-11 |
LA4SR: illuminating the dark proteome with generative AI |
David R. Nelson et.al. |
2411.06798 |
null |
2024-11-11 |
LFSamba: Marry SAM with Mamba for Light Field Salient Object Detection |
Zhengyi Liu et.al. |
2411.06652 |
link |
2024-11-10 |
KMM: Key Frame Mask Mamba for Extended Motion Generation |
Zeyu Zhang et.al. |
2411.06481 |
link |
2024-11-10 |
SEM-Net: Efficient Pixel Modelling for image inpainting with Spatially Enhanced SSM |
Shuang Chen et.al. |
2411.06318 |
link |
2024-11-09 |
Selective State Space Model for Monaural Speech Enhancement |
Moran Chen et.al. |
2411.06217 |
null |
2024-11-06 |
DiMSUM: Diffusion Mamba – A Scalable and Unified Spatial-Frequency Method for Image Generation |
Hao Phung et.al. |
2411.04168 |
link |
2024-11-06 |
Can Custom Models Learn In-Context? An Exploration of Hybrid Architecture Performance on In-Context Learning Tasks |
Ryan Campbell et.al. |
2411.03945 |
link |
2024-11-06 |
MambaPEFT: Exploring Parameter-Efficient Fine-Tuning for Mamba |
Masakazu Yoshimura et.al. |
2411.03855 |
link |
2024-11-06 |
Towards 3D Semantic Scene Completion for Autonomous Driving: A Meta-Learning Framework Empowered by Deformable Large-Kernel Attention and Mamba Model |
Yansong Qu et.al. |
2411.03672 |
null |
2024-11-05 |
ShadowMamba: State-Space Model with Boundary-Region Selective Scan for Shadow Removal |
Xiujin Zhu et.al. |
2411.03260 |
null |
2024-11-05 |
A Mamba Foundation Model for Time Series Forecasting |
Haoyu Ma et.al. |
2411.02941 |
null |
2024-11-12 |
LE-PDE++: Mamba for accelerating PDEs Simulations |
Aoming Liang et.al. |
2411.01897 |
null |
2024-11-03 |
Optical Flow Representation Alignment Mamba Diffusion Model for Medical Video Generation |
Zhenbin Wang et.al. |
2411.01647 |
null |
2024-11-21 |
BiT-MamSleep: Bidirectional Temporal Mamba for EEG Sleep Staging |
Xinliang Zhou et.al. |
2411.01589 |
null |
2024-11-03 |
MambaReg: Mamba-Based Disentangled Convolutional Sparse Coding for Unsupervised Deformable Multi-Modal Image Registration |
Kaiang Wen et.al. |
2411.01399 |
null |
2024-11-05 |
A versatile framework for attitude tuning of beamlines at advanced light sources |
Peng-Cheng Li et.al. |
2411.01278 |
null |
2024-11-05 |
Beyond the EPICS: comprehensive Python IOC development with QueueIOC |
Peng-Cheng Li et.al. |
2411.01258 |
null |
2024-10-31 |
SambaMixer: State of Health Prediction of Li-ion Batteries using Mamba State Space Models |
José Ignacio Olalde-Verano et.al. |
2411.00233 |
link |
2024-10-31 |
NIMBA: Towards Robust and Principled Processing of Point Clouds With SSMs |
Nursena Köprücü et.al. |
2411.00151 |
null |
2024-11-01 |
Dynamical similarity analysis uniquely captures how computations develop in RNNs |
Quentin Guilhot et.al. |
2410.24070 |
link |