Most recent papers in the journal IEEE Transactions on Image Processing : a Publication of the IEEE Signal Processing Society

#1

JOURNAL ARTICLE

Multi-Relational Deep Hashing for Cross-Modal Search.

Xiao Liang, Erkun Yang, Yanhua Yang, Cheng Deng

Deep cross-modal hashing retrieval has recently made significant progress. However, existing methods generally learn hash functions with pairwise or triplet supervisions, which involves learning the relevant information by splicing partial similarity between data pairs; notably, this approach only captures the data similarity locally and incompletely, resulting in sub-optimal retrieval performance. In this paper, we propose a novel Multi-Relational Deep Hashing (MRDH) approach, which can fully bridge the modality gap by comprehensively modeling the similarity relationship between data in different modalities...

38625760

April 16, 2024: IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society

#2

JOURNAL ARTICLE

GLPanoDepth: Global-to-Local Panoramic Depth Estimation.

Jiayang Bai, Haoyu Qin, Shuichang Lai, Jie Guo, Yanwen Guo

Depth estimation is a fundamental task in many vision applications. With the popularity of omnidirectional cameras, it becomes a new trend to tackle this problem in the spherical space. In this paper, we propose a learning-based method for predicting dense depth values of a scene from a monocular omnidirectional image. An omnidirectional image has a full field-of-view, providing much more complete descriptions of the scene than perspective images. However, fully-convolutional networks that most current solutions rely on fail to capture rich global contexts from the panorama...

38619939

April 15, 2024: IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society

#3

JOURNAL ARTICLE

Saliency Guided Deep Neural Network for Color Transfer with Light Optimization.

Yuming Fang, Pengwei Yuan, Chenlei Lv, Chen Peng, Jiebin Yan, Weisi Lin

Color transfer aims to change the color information of the target image according to the reference one. Many studies propose color transfer methods by analysis of color distribution and semantic relevance, which do not take the perceptual characteristics for visual quality into consideration. In this study, we propose a novel color transfer method based on the saliency information with brightness optimization. First, a saliency detection module is designed to separate the foreground regions from the background regions for images...

38607703

April 12, 2024: IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society

#4

JOURNAL ARTICLE

Deep Variation Prior: Joint Image Denoising and Noise Variance Estimation without Clean Data.

Rihuan Ke

With recent deep learning based approaches showing promising results in removing noise from images, the best denoising performance has been reported in a supervised learning setup that requires a large set of paired noisy images and ground truth data for training. The strong data requirement can be mitigated by unsupervised learning techniques, however, accurate modelling of images or noise variances is still crucial for high-quality solutions. The learning problem is ill-posed for unknown noise distributions...

38607702

April 12, 2024: IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society

#5

JOURNAL ARTICLE

ISTR: Mask-Embedding-Based Instance Segmentation Transformer.

Jie Hu, Yao Lu, Shengchuan Zhang, Liujuan Cao

Transformer-based instance-level recognition has attracted increasing research attention recently due to the superior performance. However, although attempts have been made to encode masks as embeddings into Transformer-based frameworks, how to combine mask embeddings and spatial information for a transformer-based approach is still not fully explored. In this paper, we revisit the design of mask-embedding-based pipelines and propose an Instance Segmentation TRansformer (ISTR) with Mask Meta-Embeddings (MME), leveraging the strengths of transformer models in encoding embedding information and incorporating spatial information from mask embeddings...

38607701

April 12, 2024: IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society

#6

JOURNAL ARTICLE

RefQSR: Reference-based Quantization for Image Super-Resolution Networks.

Hongjae Lee, Jun-Sang Yoo, Seung-Won Jung

Single image super-resolution (SISR) aims to reconstruct a high-resolution image from its low-resolution observation. Recent deep learning-based SISR models show high performance at the expense of increased computational costs, limiting their use in resource-constrained environments. As a promising solution for computationally efficient network design, network quantization has been extensively studied. However, existing quantization methods developed for SISR have yet to effectively exploit image self-similarity, which is a new direction for exploration in this study...

38598375

April 10, 2024: IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society

#7

JOURNAL ARTICLE

High-quality and Diverse Few-shot Image Generation via Masked Discrimination.

Jingyuan Zhu, Huimin Ma, Jiansheng Chen, Jian Yuan

Few-shot image generation aims to generate images of high quality and great diversity with limited data. However, it is difficult for modern GANs to avoid overfitting when trained on only a few images. The discriminator can easily remember all the training samples and guide the generator to replicate them, leading to severe diversity degradation. Several methods have been proposed to relieve overfitting by adapting GANs pre-trained on large source domains to target domains using limited real samples. This work presents masked discrimination to realize few-shot GAN adaptation, which is the first feature-level augmentation method for generative tasks...

38598374

April 10, 2024: IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society

#8

JOURNAL ARTICLE

Nonconvex Robust High-Order Tensor Completion Using Randomized Low-Rank Approximation.

Wenjin Qin, Hailin Wang, Feng Zhang, Weijun Ma, Jianjun Wang, Tingwen Huang

Within the tensor singular value decomposition (T-SVD) framework, existing robust low-rank tensor completion approaches have made great achievements in various areas of science and engineering. Nevertheless, these methods involve the T-SVD based low-rank approximation, which suffers from high computational costs when dealing with large-scale tensor data. Moreover, most of them are only applicable to third-order tensors. Against these issues, in this article, two efficient low-rank tensor approximation approaches fusing random projection techniques are first devised under the order-d (d ≥ 3) T-SVD framework...

38598373

April 10, 2024: IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society

#9

JOURNAL ARTICLE

Single Stage Adaptive Multi-Attention Network for Image Restoration.

Anas Zafar, Danyal Aftab, Rizwan Qureshi, Xinqi Fan, Pingjun Chen, Jia Wu, Hazrat Ali, Shah Nawaz, Sheheryar Khan, Mubarak Shah

Recently attention-based networks have been successful for image restoration tasks. However, existing methods are either computationally expensive or have limited receptive fields, adding constraints to the model. They are also less resilient in spatial and contextual aspects and lack pixel-to-pixel correspondence, which may degrade feature representations. In this paper, we propose a novel and computationally efficient architecture Single Stage Adaptive Multi-Attention Network (SSAMAN) for image restoration tasks, particularly for image denoising and image deblurring...

38598372

April 10, 2024: IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society

#10

JOURNAL ARTICLE

Source-guided Target Feature Reconstruction for Cross-domain Classification and Detection.

Yifan Jiao, Hantao Yao, Bing-Kun Bao, Changsheng Xu

Existing cross-domain classification and detection methods usually apply a consistency constraint between the target sample and its self-augmentation for unsupervised learning without considering the essential source knowledge. In this paper, we propose a Source-guided Target Feature Reconstruction (STFR) module for cross-domain visual tasks, which applies source visual words to reconstruct the target features. Since the reconstructed target features contain the source knowledge, they can be treated as a bridge to connect the source and target domains...

38593019

April 9, 2024: IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society

#11

JOURNAL ARTICLE

Relationship-Incremental Scene Graph Generation by a Divide-and-Conquer Pipeline with Feature Adapter.

Xuewei Li, Guangcong Zheng, Yunlong Yu, Naye Ji, Xi Li

As a challenging computer vision task, Scene Graph Generation (SGG) finds the latent semantic relationships among objects from a given image, which may be limited by the datasets and real-world scenarios. In this paper, we consider a novel incremental learning task called Relationship-Incremental Scene Graph Generation (RISGG) that learns the semantic relationships among objects in an incremental way. Compared with classic Class-Incremental Learning (CIL) problem, RISGG suffers from its special issues: (1) Old class shift - the relationship-labeled object pair may have different labels during different learning sessions...

38587950

April 8, 2024: IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society

#12

JOURNAL ARTICLE

Generalizing to Out-of-Sample Degradations via Model Reprogramming.

Runhua Jiang, Yahong Han

Existing image restoration models are typically designed for specific tasks and struggle to generalize to out-of-sample degradations not encountered during training. While zero-shot methods can address this limitation by fine-tuning model parameters on testing samples, their effectiveness relies on predefined natural priors and physical models of specific degradations. Nevertheless, determining out-of-sample degradations faced in real-world scenarios is always impractical. As a result, it is more desirable to train restoration models with inherent generalization ability...

38578860

April 5, 2024: IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society

#13

JOURNAL ARTICLE

DriftRec: Adapting diffusion models to blind JPEG restoration.

Simon Welker, Henry N Chapman, Timo Gerkmann

In this work, we utilize the high-fidelity generation abilities of diffusion models to solve blind JPEG restoration at high compression levels. We propose an elegant modification of the forward stochastic differential equation of diffusion models to adapt them to this restoration task and name our method DriftRec. Comparing DriftRec against an L2 regression baseline with the same network architecture and state-of-the-art techniques for JPEG restoration, we show that our approach can escape the tendency of other methods to generate blurry images, and recovers the distribution of clean images significantly more faithfully...

38578859

April 5, 2024: IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society

#14

JOURNAL ARTICLE

Shared Manifold Regularized Joint Feature Selection for Joint Classification and Regression in Alzheimer's Disease Diagnosis.

Zhi Chen, Yongguo Liu, Yun Zhang, Jiajing Zhu, Qiaoqin Li, Xindong Wu

In Alzheimer's disease (AD) diagnosis, joint feature selection for predicting disease labels (classification) and estimating cognitive scores (regression) with neuroimaging data has received increasing attention. In this paper, we propose a model named Shared Manifold regularized Joint Feature Selection (SMJFS) that performs classification and regression in a unified framework for AD diagnosis. For classification, unlike the existing works that build least squares regression models which are insufficient in the ability of extracting discriminative information for classification, we design an objective function that integrates linear discriminant analysis and subspace sparsity regularization for acquiring an informative feature subset...

38578858

April 4, 2024: IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society

#15

JOURNAL ARTICLE

Hierarchical Perceptual Noise Injection for Social Media Fingerprint Privacy Protection.

Simin Li, Huangxinxin Xu, Jiakai Wang, Ruixiao Xu, Aishan Liu, Fazhi He, Xianglong Liu, Dacheng Tao

Billions of people share images from their daily lives on social media every day. However, their biometric information (e.g., fingerprints) could be easily stolen from these images. The threat of fingerprint leakage from social media has created a strong desire to anonymize shared images while maintaining image quality, since fingerprints act as a lifelong individual biometric password. To guard the fingerprint leakage, adversarial attack that involves adding imperceptible perturbations to fingerprint images have emerged as a feasible solution...

38557629

April 1, 2024: IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society

#16

JOURNAL ARTICLE

Orthogonal spatial binary coding method for high-speed 3D measurement.

Haitao Wu, Yiping Cao, Yongbo Dai, Zhimi Wei

Temporal phase unwrapping based on single auxiliary binary coded pattern has been proven to be effective for high-speed 3D measurement. However, in traditional spatial binary coding, it often leads to an imbalance between the number of periodic divisions and codewords. To meet this challenge, a large codewords orthogonal spatial binary coding method is proposed in this paper. By expanding spatial multiplexing from 1D to 2D orthogonal direction, it goes beyond the traditional 8 codewords to 27 codewords at three-level periodic division...

38557628

April 1, 2024: IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society

#17

JOURNAL ARTICLE

Anomaly Detection for Medical Images using Heterogeneous Auto-Encoder.

Shuai Lu, Weihang Zhang, He Zhao, Hanruo Liu, Ningli Wang, Huiqi Li

Anomaly detection is an important task for medical image analysis, which can alleviate the reliance of supervised methods on large labelled datasets. Most existing methods use a pixel-wise self-reconstruction framework for anomaly detection. However, there are two challenges of these studies: 1) they tend to overfit learning an identity mapping between the input and output, which leads to failure in detecting abnormal samples; 2) the reconstruction considers the pixel-wise differences which may lead to an undesirable result...

38551828

March 29, 2024: IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society

#18

JOURNAL ARTICLE

Region Aware Video Object Segmentation with Deep Motion Modeling.

Bo Miao, Mohammed Bennamoun, Yongsheng Gao, Ajmal Mian

Current semi-supervised video object segmentation (VOS) methods often employ the entire features of one frame to predict object masks and update memory. This introduces significant redundant computations. To reduce redundancy, we introduce a Region Aware Video Object Segmentation (RAVOS) approach, which predicts regions of interest (ROIs) for efficient object segmentation and memory storage. RAVOS includes a fast object motion tracker to predict object ROIs in the next frame. For efficient segmentation, object features are extracted based on the ROIs, and an object decoder is designed for object-level segmentation...

38551827

March 29, 2024: IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society

#19

JOURNAL ARTICLE

Knowledge-Augmented Visual Question Answering with Natural Language Explanation.

Jiayuan Xie, Yi Cai, Jiali Chen, Ruohang Xu, Jiexin Wang, Qing Li

Visual question answering with natural language explanation (VQA-NLE) is a challenging task that requires models to not only generate accurate answers but also to provide explanations that justify the relevant decision-making processes. This task is accomplished by generating natural language sentences based on the given question-image pair. However, existing methods often struggle to ensure consistency between the answers and explanations due to their disregard of the crucial interactions between these factors...

38546994

March 28, 2024: IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society

#20

JOURNAL ARTICLE

Robust Fine-Grained Visual Recognition with Neighbor-Attention Label Correction.

Shunan Mao, Shiliang Zhang

Existing deep learning methods for fine-grained visual recognition often rely on large-scale, well-annotated training data. Obtaining fine-grained annotations in the wild typically requires concentration and expertise, such as fine category annotation for species recognition, instance annotation for person re-identification (re-id) and dense annotation for segmentation, which inevitably leads to label noise. This paper aims to tackle label noise in deep model training for fine-grained visual recognition. We propose a Neighbor-Attention Label Correction (NALC) model to correct labels during the training stage...

38546993

March 28, 2024: IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society

Use the journals feature with a free QxMD account.

IEEE Transactions on Image Processing : a Publication of the IEEE Signal Processing Society

Save your favorite articles in one place with a free QxMD account.

Read

Search Tips