journal
Journals IEEE Transactions on Image Pro...

IEEE Transactions on Image Processing : a Publication of the IEEE Signal Processing Society

https://read.qxmd.com/read/38625760/multi-relational-deep-hashing-for-cross-modal-search
#1
JOURNAL ARTICLE
Xiao Liang, Erkun Yang, Yanhua Yang, Cheng Deng
Deep cross-modal hashing retrieval has recently made significant progress. However, existing methods generally learn hash functions with pairwise or triplet supervisions, which involves learning the relevant information by splicing partial similarity between data pairs; notably, this approach only captures the data similarity locally and incompletely, resulting in sub-optimal retrieval performance. In this paper, we propose a novel Multi-Relational Deep Hashing (MRDH) approach, which can fully bridge the modality gap by comprehensively modeling the similarity relationship between data in different modalities...
April 16, 2024: IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society
https://read.qxmd.com/read/38619939/glpanodepth-global-to-local-panoramic-depth-estimation
#2
JOURNAL ARTICLE
Jiayang Bai, Haoyu Qin, Shuichang Lai, Jie Guo, Yanwen Guo
Depth estimation is a fundamental task in many vision applications. With the popularity of omnidirectional cameras, it becomes a new trend to tackle this problem in the spherical space. In this paper, we propose a learning-based method for predicting dense depth values of a scene from a monocular omnidirectional image. An omnidirectional image has a full field-of-view, providing much more complete descriptions of the scene than perspective images. However, fully-convolutional networks that most current solutions rely on fail to capture rich global contexts from the panorama...
April 15, 2024: IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society
https://read.qxmd.com/read/38607703/saliency-guided-deep-neural-network-for-color-transfer-with-light-optimization
#3
JOURNAL ARTICLE
Yuming Fang, Pengwei Yuan, Chenlei Lv, Chen Peng, Jiebin Yan, Weisi Lin
Color transfer aims to change the color information of the target image according to the reference one. Many studies propose color transfer methods by analysis of color distribution and semantic relevance, which do not take the perceptual characteristics for visual quality into consideration. In this study, we propose a novel color transfer method based on the saliency information with brightness optimization. First, a saliency detection module is designed to separate the foreground regions from the background regions for images...
April 12, 2024: IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society
https://read.qxmd.com/read/38607702/deep-variation-prior-joint-image-denoising-and-noise-variance-estimation-without-clean-data
#4
JOURNAL ARTICLE
Rihuan Ke
With recent deep learning based approaches showing promising results in removing noise from images, the best denoising performance has been reported in a supervised learning setup that requires a large set of paired noisy images and ground truth data for training. The strong data requirement can be mitigated by unsupervised learning techniques, however, accurate modelling of images or noise variances is still crucial for high-quality solutions. The learning problem is ill-posed for unknown noise distributions...
April 12, 2024: IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society
https://read.qxmd.com/read/38607701/istr-mask-embedding-based-instance-segmentation-transformer
#5
JOURNAL ARTICLE
Jie Hu, Yao Lu, Shengchuan Zhang, Liujuan Cao
Transformer-based instance-level recognition has attracted increasing research attention recently due to the superior performance. However, although attempts have been made to encode masks as embeddings into Transformer-based frameworks, how to combine mask embeddings and spatial information for a transformer-based approach is still not fully explored. In this paper, we revisit the design of mask-embedding-based pipelines and propose an Instance Segmentation TRansformer (ISTR) with Mask Meta-Embeddings (MME), leveraging the strengths of transformer models in encoding embedding information and incorporating spatial information from mask embeddings...
April 12, 2024: IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society
https://read.qxmd.com/read/38598375/refqsr-reference-based-quantization-for-image-super-resolution-networks
#6
JOURNAL ARTICLE
Hongjae Lee, Jun-Sang Yoo, Seung-Won Jung
Single image super-resolution (SISR) aims to reconstruct a high-resolution image from its low-resolution observation. Recent deep learning-based SISR models show high performance at the expense of increased computational costs, limiting their use in resource-constrained environments. As a promising solution for computationally efficient network design, network quantization has been extensively studied. However, existing quantization methods developed for SISR have yet to effectively exploit image self-similarity, which is a new direction for exploration in this study...
April 10, 2024: IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society
https://read.qxmd.com/read/38598374/high-quality-and-diverse-few-shot-image-generation-via-masked-discrimination
#7
JOURNAL ARTICLE
Jingyuan Zhu, Huimin Ma, Jiansheng Chen, Jian Yuan
Few-shot image generation aims to generate images of high quality and great diversity with limited data. However, it is difficult for modern GANs to avoid overfitting when trained on only a few images. The discriminator can easily remember all the training samples and guide the generator to replicate them, leading to severe diversity degradation. Several methods have been proposed to relieve overfitting by adapting GANs pre-trained on large source domains to target domains using limited real samples. This work presents masked discrimination to realize few-shot GAN adaptation, which is the first feature-level augmentation method for generative tasks...
April 10, 2024: IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society
https://read.qxmd.com/read/38598373/nonconvex-robust-high-order-tensor-completion-using-randomized-low-rank-approximation
#8
JOURNAL ARTICLE
Wenjin Qin, Hailin Wang, Feng Zhang, Weijun Ma, Jianjun Wang, Tingwen Huang
Within the tensor singular value decomposition (T-SVD) framework, existing robust low-rank tensor completion approaches have made great achievements in various areas of science and engineering. Nevertheless, these methods involve the T-SVD based low-rank approximation, which suffers from high computational costs when dealing with large-scale tensor data. Moreover, most of them are only applicable to third-order tensors. Against these issues, in this article, two efficient low-rank tensor approximation approaches fusing random projection techniques are first devised under the order-d (d ≥ 3) T-SVD framework...
April 10, 2024: IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society
https://read.qxmd.com/read/38598372/single-stage-adaptive-multi-attention-network-for-image-restoration
#9
JOURNAL ARTICLE
Anas Zafar, Danyal Aftab, Rizwan Qureshi, Xinqi Fan, Pingjun Chen, Jia Wu, Hazrat Ali, Shah Nawaz, Sheheryar Khan, Mubarak Shah
Recently attention-based networks have been successful for image restoration tasks. However, existing methods are either computationally expensive or have limited receptive fields, adding constraints to the model. They are also less resilient in spatial and contextual aspects and lack pixel-to-pixel correspondence, which may degrade feature representations. In this paper, we propose a novel and computationally efficient architecture Single Stage Adaptive Multi-Attention Network (SSAMAN) for image restoration tasks, particularly for image denoising and image deblurring...
April 10, 2024: IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society
https://read.qxmd.com/read/38593019/source-guided-target-feature-reconstruction-for-cross-domain-classification-and-detection
#10
JOURNAL ARTICLE
Yifan Jiao, Hantao Yao, Bing-Kun Bao, Changsheng Xu
Existing cross-domain classification and detection methods usually apply a consistency constraint between the target sample and its self-augmentation for unsupervised learning without considering the essential source knowledge. In this paper, we propose a Source-guided Target Feature Reconstruction (STFR) module for cross-domain visual tasks, which applies source visual words to reconstruct the target features. Since the reconstructed target features contain the source knowledge, they can be treated as a bridge to connect the source and target domains...
April 9, 2024: IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society
https://read.qxmd.com/read/38587950/relationship-incremental-scene-graph-generation-by-a-divide-and-conquer-pipeline-with-feature-adapter
#11
JOURNAL ARTICLE
Xuewei Li, Guangcong Zheng, Yunlong Yu, Naye Ji, Xi Li
As a challenging computer vision task, Scene Graph Generation (SGG) finds the latent semantic relationships among objects from a given image, which may be limited by the datasets and real-world scenarios. In this paper, we consider a novel incremental learning task called Relationship-Incremental Scene Graph Generation (RISGG) that learns the semantic relationships among objects in an incremental way. Compared with classic Class-Incremental Learning (CIL) problem, RISGG suffers from its special issues: (1) Old class shift - the relationship-labeled object pair may have different labels during different learning sessions...
April 8, 2024: IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society
https://read.qxmd.com/read/38578860/generalizing-to-out-of-sample-degradations-via-model-reprogramming
#12
JOURNAL ARTICLE
Runhua Jiang, Yahong Han
Existing image restoration models are typically designed for specific tasks and struggle to generalize to out-of-sample degradations not encountered during training. While zero-shot methods can address this limitation by fine-tuning model parameters on testing samples, their effectiveness relies on predefined natural priors and physical models of specific degradations. Nevertheless, determining out-of-sample degradations faced in real-world scenarios is always impractical. As a result, it is more desirable to train restoration models with inherent generalization ability...
April 5, 2024: IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society
https://read.qxmd.com/read/38578859/driftrec-adapting-diffusion-models-to-blind-jpeg-restoration
#13
JOURNAL ARTICLE
Simon Welker, Henry N Chapman, Timo Gerkmann
In this work, we utilize the high-fidelity generation abilities of diffusion models to solve blind JPEG restoration at high compression levels. We propose an elegant modification of the forward stochastic differential equation of diffusion models to adapt them to this restoration task and name our method DriftRec. Comparing DriftRec against an L2 regression baseline with the same network architecture and state-of-the-art techniques for JPEG restoration, we show that our approach can escape the tendency of other methods to generate blurry images, and recovers the distribution of clean images significantly more faithfully...
April 5, 2024: IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society
https://read.qxmd.com/read/38578858/shared-manifold-regularized-joint-feature-selection-for-joint-classification-and-regression-in-alzheimer-s-disease-diagnosis
#14
JOURNAL ARTICLE
Zhi Chen, Yongguo Liu, Yun Zhang, Jiajing Zhu, Qiaoqin Li, Xindong Wu
In Alzheimer's disease (AD) diagnosis, joint feature selection for predicting disease labels (classification) and estimating cognitive scores (regression) with neuroimaging data has received increasing attention. In this paper, we propose a model named Shared Manifold regularized Joint Feature Selection (SMJFS) that performs classification and regression in a unified framework for AD diagnosis. For classification, unlike the existing works that build least squares regression models which are insufficient in the ability of extracting discriminative information for classification, we design an objective function that integrates linear discriminant analysis and subspace sparsity regularization for acquiring an informative feature subset...
April 4, 2024: IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society
https://read.qxmd.com/read/38557629/hierarchical-perceptual-noise-injection-for-social-media-fingerprint-privacy-protection
#15
JOURNAL ARTICLE
Simin Li, Huangxinxin Xu, Jiakai Wang, Ruixiao Xu, Aishan Liu, Fazhi He, Xianglong Liu, Dacheng Tao
Billions of people share images from their daily lives on social media every day. However, their biometric information (e.g., fingerprints) could be easily stolen from these images. The threat of fingerprint leakage from social media has created a strong desire to anonymize shared images while maintaining image quality, since fingerprints act as a lifelong individual biometric password. To guard the fingerprint leakage, adversarial attack that involves adding imperceptible perturbations to fingerprint images have emerged as a feasible solution...
April 1, 2024: IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society
https://read.qxmd.com/read/38557628/orthogonal-spatial-binary-coding-method-for-high-speed-3d-measurement
#16
JOURNAL ARTICLE
Haitao Wu, Yiping Cao, Yongbo Dai, Zhimi Wei
Temporal phase unwrapping based on single auxiliary binary coded pattern has been proven to be effective for high-speed 3D measurement. However, in traditional spatial binary coding, it often leads to an imbalance between the number of periodic divisions and codewords. To meet this challenge, a large codewords orthogonal spatial binary coding method is proposed in this paper. By expanding spatial multiplexing from 1D to 2D orthogonal direction, it goes beyond the traditional 8 codewords to 27 codewords at three-level periodic division...
April 1, 2024: IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society
https://read.qxmd.com/read/38551828/anomaly-detection-for-medical-images-using-heterogeneous-auto-encoder
#17
JOURNAL ARTICLE
Shuai Lu, Weihang Zhang, He Zhao, Hanruo Liu, Ningli Wang, Huiqi Li
Anomaly detection is an important task for medical image analysis, which can alleviate the reliance of supervised methods on large labelled datasets. Most existing methods use a pixel-wise self-reconstruction framework for anomaly detection. However, there are two challenges of these studies: 1) they tend to overfit learning an identity mapping between the input and output, which leads to failure in detecting abnormal samples; 2) the reconstruction considers the pixel-wise differences which may lead to an undesirable result...
March 29, 2024: IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society
https://read.qxmd.com/read/38551827/region-aware-video-object-segmentation-with-deep-motion-modeling
#18
JOURNAL ARTICLE
Bo Miao, Mohammed Bennamoun, Yongsheng Gao, Ajmal Mian
Current semi-supervised video object segmentation (VOS) methods often employ the entire features of one frame to predict object masks and update memory. This introduces significant redundant computations. To reduce redundancy, we introduce a Region Aware Video Object Segmentation (RAVOS) approach, which predicts regions of interest (ROIs) for efficient object segmentation and memory storage. RAVOS includes a fast object motion tracker to predict object ROIs in the next frame. For efficient segmentation, object features are extracted based on the ROIs, and an object decoder is designed for object-level segmentation...
March 29, 2024: IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society
https://read.qxmd.com/read/38546994/knowledge-augmented-visual-question-answering-with-natural-language-explanation
#19
JOURNAL ARTICLE
Jiayuan Xie, Yi Cai, Jiali Chen, Ruohang Xu, Jiexin Wang, Qing Li
Visual question answering with natural language explanation (VQA-NLE) is a challenging task that requires models to not only generate accurate answers but also to provide explanations that justify the relevant decision-making processes. This task is accomplished by generating natural language sentences based on the given question-image pair. However, existing methods often struggle to ensure consistency between the answers and explanations due to their disregard of the crucial interactions between these factors...
March 28, 2024: IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society
https://read.qxmd.com/read/38546993/robust-fine-grained-visual-recognition-with-neighbor-attention-label-correction
#20
JOURNAL ARTICLE
Shunan Mao, Shiliang Zhang
Existing deep learning methods for fine-grained visual recognition often rely on large-scale, well-annotated training data. Obtaining fine-grained annotations in the wild typically requires concentration and expertise, such as fine category annotation for species recognition, instance annotation for person re-identification (re-id) and dense annotation for segmentation, which inevitably leads to label noise. This paper aims to tackle label noise in deep model training for fine-grained visual recognition. We propose a Neighbor-Attention Label Correction (NALC) model to correct labels during the training stage...
March 28, 2024: IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society
journal
journal
34155
1
2
Fetch more papers »
Fetching more papers... Fetching...
Remove bar
Read by QxMD icon Read
×

Save your favorite articles in one place with a free QxMD account.

×

Search Tips

Use Boolean operators: AND/OR

diabetic AND foot
diabetes OR diabetic

Exclude a word using the 'minus' sign

Virchow -triad

Use Parentheses

water AND (cup OR glass)

Add an asterisk (*) at end of a word to include word stems

Neuro* will search for Neurology, Neuroscientist, Neurological, and so on

Use quotes to search for an exact phrase

"primary prevention of cancer"
(heart or cardiac or cardio*) AND arrest -"American Heart Association"

We want to hear from doctors like you!

Take a second to answer a survey question.