Towards Transparent Deep Image Aesthetics Assessment with Tag-based Content Descriptors

Towards Transparent Deep Image Aesthetics Assessment with Tag-based Content Descriptors

Transparent Deep Image Aesthetic Assessment Based on Tag Content Descriptions Academic Background With the proliferation of social media platforms such as Instagram and Flickr, there is an increasing demand for Image Aesthetics Assessment (IAA) models. These models can help social network service providers optimize image ranking or recommendation r...

Balancing Feature Alignment and Uniformity for Few-Shot Classification

Balancing Feature Alignment and Uniformity for Few-Shot Classification

Solving Few-Shot Classification Problems with Balanced Feature Alignment and Uniformity Background and Motivation The goal of Few-Shot Learning (FSL) is to correctly recognize new samples with only a few examples from new classes. Existing few-shot learning methods mainly learn transferable knowledge from base classes by maximizing the information ...

Negative Deterministic Information-Based Multiple Instance Learning for Weakly Supervised Object Detection and Segmentation

Negative Deterministic Information-Based Multiple Instance Learning for Weakly Supervised Object Detection and Segmentation

Negative Deterministic Information-Based Multiple Instance Learning for Weakly Supervised Object Detection and Segmentation Background Introduction In the past decade, significant progress has been made in the field of computer vision, particularly in object detection and semantic segmentation. However, most of the designed algorithms and models he...

Advancing Hyperspectral and Multispectral Image Fusion: An Information-Aware Transformer-Based Unfolding Network

Advancing Hyperspectral and Multispectral Image Fusion: An Information-Aware Transformer-Based Unfolding Network

Information-aware Transformer Unfolding Network for Hyperspectral and Multispectral Image Fusion Background Introduction Hyperspectral images (HSIs) play a crucial role in remote sensing applications such as material identification, image classification, target detection, and environmental monitoring, due to their spectral information across multip...

A Graph-Neural-Network-Powered Solver Framework for Graph Optimization Problems

A Graph-Neural-Network-Powered Solver Framework for Graph Optimization Problems

A Framework for Solving Graph Optimization Problems Based on Graph Neural Networks Background and Research Motivation In solving Constraint Satisfaction Problems (CSPs) and Combinatorial Optimization Problems (COPs), a common method is the combination of backtracking and branch heuristics. Although branch heuristics designed for specific problems a...

Weakly Supervised Semantic Segmentation via Alternate Self-Dual Teaching

Weakly Supervised Semantic Segmentation via Alternate Self-Dual Teaching

Weakly Supervised Semantic Image Segmentation via Alternate Self-Dual Teaching Background Introduction With the continuous development of the computer vision field, semantic segmentation has become an important and active research direction. Traditional semantic segmentation methods rely on manually labeled pixel-level tags; however, obtaining thes...

Robust Multiobjective Reinforcement Learning Considering Environmental Uncertainties

Background In recent years, Reinforcement Learning (RL) has demonstrated its effectiveness in solving various complex tasks. However, many real-world decision-making and control problems involve multiple conflicting objectives. The relative importance (preference) of these objectives often needs to be balanced against each other in different scenar...

Modulating Effective Receptive Fields for Convolutional Kernels

GMConv: Adjusting the Effective Receptive Field of Convolutional Neural Networks Introduction Convolutional Neural Networks (CNNs) have achieved significant success in computer vision tasks, including image classification and object detection, through the use of convolutional kernels. However, in recent years, Vision Transformers (ViTs) have gained...

A bilingual speech neuroprosthesis driven by cortical articulatory representations shared between languages

Bilingual Speech Neuroprosthesis Driven by Cortical Speech Representations Background In the development of neuroprostheses, research on decoding language from brain activity has primarily focused on decoding a single language. Thus, the extent to which bilingual speech production relies on unique or shared cortical activity between different langu...

Strokeclassifier: Ischemic Stroke Etiology Classification by Ensemble Consensus Modeling Using Electronic Health Records

StrokeClassifier: An AI Tool for Etiological Classification of Ischemic Stroke Based on Electronic Health Records Project Background and Motivation Identifying the etiology of strokes, particularly acute ischemic stroke (AIS), is crucial for secondary prevention, but it is often very challenging. In the United States, there are nearly 676,000 new c...