CryoTEN: Efficiently Enhancing Cryo-EM Density Maps Using Transformers

Academic Background Cryogenic Electron Microscopy (Cryo-EM) is a crucial experimental technique for determining the structures of macromolecules such as proteins. However, the effectiveness of Cryo-EM is often hindered by noise and missing density values caused by experimental conditions such as low contrast and conformational heterogeneity. Althou...

DRTN: Dual Relation Transformer Network with Feature Erasure and Contrastive Learning for Multi-Label Image Classification

New Breakthrough in Multi-Label Image Classification: Dual Relation Transformer Network Academic Background Multi-Label Image Classification (MLIC) is a fundamental yet highly challenging problem in the field of computer vision. Unlike single-label image classification, MLIC aims to assign multiple labels to objects within a single image. Due to th...

Learning with Enriched Inductive Biases for Vision-Language Models

Learning with Enriched Inductive Biases for Vision-Language Models Research Background and Problem Statement In recent years, Vision-Language Models (VLMs) have made significant progress in the fields of computer vision and natural language processing. These models are pre-trained on large-scale image-text pairs to construct a unified multimodal re...

A Mutual Supervision Framework for Referring Expression Segmentation and Generation

A Mutual Supervision Framework for Referring Expression Segmentation and Generation

A Mutual Supervision Framework for Referring Expression Segmentation and Generation Research Background and Problem Statement In recent years, vision-language interaction technology has made remarkable progress in the field of artificial intelligence. Among these advancements, referring expression segmentation (RES) and referring expression generat...

An End-to-End Visual Semantic Localization Network Using Multi-View Images

A Study on End-to-End Visual Semantic Localization Using Multi-View Images Background and Research Significance With the rapid development of intelligent driving technology, precise localization of autonomous vehicles has become a hot topic in research and industry. Accurate vehicle localization is not only a core module of autonomous driving but a...

Transformer for Object Re-Identification: A Survey

Background and Significance Object re-identification (Re-ID) is an essential task in computer vision aimed at identifying specific objects across different times and scenes. Driven by deep learning, particularly convolutional neural networks (CNNs), this field has made significant strides. However, the emergence of vision transformers has opened ne...

A Transformer-Based Approach Combining Deep Learning Network and Spatial-Temporal Information for Raw EEG Classification

Research Background and Objectives In recent years, Brain-Computer Interface (BCI) systems have been widely applied in the fields of neuroengineering and neuroscience. Electroencephalogram (EEG), as a data tool reflecting the activities of different neuronal groups in the central nervous system, has become a core research topic in these fields. How...