MDD-Net: Multimodal Depression Detection through Mutual Transformer Paper • 2508.08093 • Published 12 days ago
MMFformer: Multimodal Fusion Transformer Network for Depression Detection Paper • 2508.06701 • Published 15 days ago
BaDLAD: A Large Multi-Domain Bengali Document Layout Analysis Dataset Paper • 2303.05325 • Published Mar 9, 2023
GNN-ViTCap: GNN-Enhanced Multiple Instance Learning with Vision Transformers for Whole Slide Image Classification and Captioning Paper • 2507.07006 • Published Jul 9
FusionEnsemble-Net: An Attention-Based Ensemble of Spatiotemporal Networks for Multimodal Sign Language Recognition Paper • 2508.09362 • Published 11 days ago
A Signer-Invariant Conformer and Multi-Scale Fusion Transformer for Continuous Sign Language Recognition Paper • 2508.09372 • Published 11 days ago
SAP-CoPE: Social-Aware Planning using Cooperative Pose Estimation with Infrastructure Sensor Nodes Paper • 2504.05727 • Published Apr 8
An Efficient Approach to Generate Safe Drivable Space by LiDAR-Camera-HDmap Fusion Paper • 2410.22314 • Published Oct 29, 2024
Enhancing Indoor Mobility with Connected Sensor Nodes: A Real-Time, Delay-Aware Cooperative Perception Approach Paper • 2411.02624 • Published Nov 4, 2024
CoInfra: A Large-Scale Cooperative Infrastructure Perception System and Dataset in Adverse Weather Paper • 2507.02245 • Published Jul 3 • 1
StructEval: Benchmarking LLMs' Capabilities to Generate Structural Outputs Paper • 2505.20139 • Published May 26 • 18
Bench-NPIN: Benchmarking Non-prehensile Interactive Navigation Paper • 2505.12084 • Published May 17 • 2
LookAhead: Preventing DeFi Attacks via Unveiling Adversarial Contracts Paper • 2401.07261 • Published Jan 14, 2024
DenseMamba: State Space Models with Dense Hidden Connection for Efficient Large Language Models Paper • 2403.00818 • Published Feb 26, 2024 • 20
Simple Disentanglement of Style and Content in Visual Representations Paper • 2302.09795 • Published Feb 20, 2023 • 1