O1 |
Image/Video Enhancement I |
Time |
|
Chair |
Maggie Zhu (Purdue University) |
ID |
Title |
Author |
88 |
FGF-GAN:
A Lightweight Generative Adversarial Network For Pansharpening Via Fast
Guided Filter |
Zixiang
Zhao (Xi’an Jiaotong University)*; Jiangshe Zhang (Xi'an Jiaotong
University); Shuang Xu (Xi'an Jiaotong University); Kai Sun (Xi'an Jiaotong
University); Lu Huang (Xi’an Jiaotong University); Junmin Liu (Xi'an Jiaotong
University); Chunxia Zhang (Xi'an Jiaotong University) |
253 |
Collaborative
Reflectance-and-Illumination Learning for High-Efficient Low-light Image
Enhancement |
Guijing
Zhu (Dalian University of Technology); Long Ma (Dalian University of
Technology); Risheng Liu (Dalian University of Technology)*; Xin Fan (Dalian
University of Technology); Zhongxuan Luo (DALIAN UNIVERSITY OF TECHNOLOGY) |
308 |
Organ-Branched-CNN
for Robust Face Super-Resolution |
Jichun
Li (Fudan University); Bahetiyaer Bare (Fudan University); Shili Zhou (Fudan
University); Bo Yan (Fudan University)*; Ke Li (Fudan University) |
350 |
Learning
Long-Term Style Preserving Blind Video Temporal Consistency |
Hugo
Thimonier (L'Oréal Research and Innovation)*; Julien Despois (L’Oréal
Research and Innovation); Robin Kips (L'Oréal Research and Innovation);
Matthieu Perrot ( L’Oréal Research and Innovation) |
441 |
ISTA-Net++:
Flexible Deep Unfolding Network for Compressive Sensing |
Di
You (Peking University); Jingfen Xie (Peking University); Jian Zhang (Peking
University Shenzhen Graduate School)* |
456 |
Spatial
Graph Convolutional Network for Image Super-Resolution |
Yue
Yang (Xi’an Jiaotong University)*; Yong Qi (Xi’an Jiaotong University) |
|
|
|
O2 |
Cross-modal and multi-modal multimedia analysis |
Time |
|
Chair |
Bihan Wen (Nanyang Technological University) |
ID |
Title |
Author |
41 |
HIERARCHICAL
REPRESENTATION NETWORK WITH AUXILIARY TASKS FOR VIDEOCAPTIONING |
Yu
Lei (University of Electronic Science and Technology of China); Zhonghai He
(UESTC)*; Pengpeng Zeng (University of Electronic Science and Technology of
China); Jingkuan Song (UESTC); Lianli Gao (The University of Electronic
Science and Technology of China) |
115 |
Label-specific
Alignment with Adversarial Multi-view Representation |
Yi
Zhang (Nanjing University)*; Jundong Shen (Nanjing University); Cheng Yu
( Nanjing University); Chongjun Wang (Nanjing University) |
214 |
Weakly-supervised
Audio-visual Sound Source Detection and Separation |
Tanzila
Rahman (University of British Columbia )*; Leonid Sigal (University of
British Columbia) |
799 |
Combine
Early and Late Fusion Together: A Hybrid Fusion Framework for Image-Text
Matching |
Yifan
Wang (University of Electronic Science and Technology of China); Xing Xu
(University of Electronic Science and Technology of China)*; Wei Yu
(University of Electronic Science and Technology of China); Ruicong Xu
(MEITUAN); Zuo Cao (MEITUAN); Heng Tao Shen (University of Electronic Science
and Technology of China (UESTC)) |
1137 |
Tensor-based
Multi-view Block-diagonal Structure Diffusion for Clustering Incomplete
Multi-view Data |
Zhenglai
Li (China University of Geosciences); Chang Tang (China University of
Geosciences)*; Xinwang Liu (National University of Defense Technology); Xiao
Zheng (National University of Defense Technology); Wei Zhang (Qilu University
of Technology); En Zhu (National University of Defense Technology) |
1389 |
Multi-Dimensional
Attentive Hierarchical Graph Pooling Network for Video-Text Retrieval |
Dehao
Wu (Peking University Shenzhen Graduate School)*; Yi Li (Peking University
Shenzhen Graduate School); Yinghong Zhang (Peking University Shenzhen
Graduate School); Yuesheng Zhu (Peking University Shenzhen Graduate School) |
|
|
|
O3 |
Emerging applications of artificial
intelligence |
Time |
|
Chair |
Zhang Wei (Singapore Institute of Technology) |
ID |
Title |
Author |
566 |
Class
Forge: Boosting Feature Encoder for Few-shot Learning with Synthesized
Classes |
Rui-Qi
Wang (Institute of Automation, Chinese Academy of Sciences)*; Xu-Yao Zhang
(Institute of Automation of Chinese Academy of Sciences); Cheng-Lin Liu
(Institute of Automation of Chinese Academy of Sciences) |
568 |
GSS:
Graph-based Subspace Learning with Shots Initialization for Few-shot
Recognition |
Rui-Qi
Wang (Institute of Automation, Chinese Academy of Sciences)*; Xu-Yao Zhang
(Institute of Automation of Chinese Academy of Sciences); Cheng-Lin Liu
(Institute of Automation of Chinese Academy of Sciences) |
688 |
Truth
Inference with Bipartite Attention Graph Neural Network from a Comprehensive
View |
Jiacheng
Liu (Shanghai Jiao Tong University); Feilong Tang (Shanghai Jiao Tong
University)*; Jielong Huang (Alibaba Group) |
714 |
Calibration
for Non-exemplar based Class-incremental Learning |
Fei
Zhu (Institute of Automation of Chinese Academy of Science)*; Xu-Yao Zhang
(Institute of Automation of Chinese Academy of Sciences); Cheng-Lin Liu
(Institute of Automation of Chinese Academy of Sciences) |
746 |
Revisiting
Graph Neural Networks for Node Classification in Heterogeneous Graphs |
Ye
Tao (Peking University)*; Ying Li (Peking University); Zhonghai Wu (Peking
University) |
759 |
DDPER:
Decentralized Distributed Prioritized Experience Replay |
Sidun
Liu (NUDT); Peng Qiao (NUDT)*; Yong Dou (National University of Defense
Technology); Rongchun Li (National Laboratory for Parallel and Distributed
Processing, National University of Defense Technology,Changsha,Hunan) |
|
|
|
O4 |
Multimedia databases and data mining |
Time |
|
Chair |
Yueqi Duan (Stanford University) |
ID |
Title |
Author |
370 |
HAZY
RE-ID: AN INTERFERENCE SUPPRESSION MODEL FOR DOMAIN ADAPTATION PERSON
RE-IDENTIFICATION UNDER INCLEMENT WEATHER CONDITION |
Jian
Pang (China University of Petroleum (East China)); Dacheng Zhang (Kunming
University of Science and Technology); Huafeng Li (Kunming University of
Science and Technology)*; Weifeng Liu (China University of Petroleum (East
China)); Zhengtao Yu (Kunming University of Science and Technology) |
440 |
Adaptive
Deep Metric Ensemble Learning with Consensus |
Ping
Li (Hangzhou Dianzi University)*; Guopan Zhao (Hangzhou Dianzi University);
Huaxin Xiao (National University of Defense Technology) |
682 |
Weakly-Supervised
Online Hashing |
Yu-Wei
Zhan (Shandong University); Xin
Luo (Shandong University)*; Yu Sun (Shandong University); Yongxin Wang
(Shandong University); Zhen-Duo Chen (Shandong University); Xin-Shun Xu
(Shandong University) |
761 |
Deep
Unsupervised Hashing by Distilled Smooth Guidance |
Xiao
Luo (Peking University); Zeyu Ma (Harbin Institute of Technology, Shenzhen);
Daqing Wu (Peking University); Huasong Zhong (Alibaba); Chong Chen (Alibaba);
Jinwen Ma (Peking University); Minghua Deng (Peking University)* |
647 |
Tensor-based
Unsupervised Multi-view Feature Selection for Image Recognition |
Yongshan
Zhang (China University of Geosciences)*; Xinxin Wang (China University of
Geosciences); Zhihua Cai (China University of Geosciences); Yicong Zhou
(University of Macau); Philip S Yu (UNIVERSITY OF ILLINOIS AT CHICAGO) |
1129 |
Supervised
Video Summarization via Multiple Feature Sets with Parallel Attention |
Junaid
Ahmed Ghauri (TIB - Leibniz Information Centre for Science and Technology)*;
Sherzod Hakimov (TIB - Leibniz Information Centre for Science and
Technology); Ralph Ewerth (TIB - Leibniz Information Center for Science and
Technology) |
|
|
|
O5 |
Speech/audio synthesis and coding |
Time |
|
Chair |
Jahangir Alam (Computer Research Institute of
Montreal) |
ID |
Title |
Author |
451 |
CROSS-DOMAIN
SINGLE-CHANNEL SPEECH ENHANCEMENT MODEL WITH BI-PROJECTION FUSION MODULE FOR
NOISE-ROBUST ASR |
Fu-An
Chao (National Taiwan Normal University)*; Jeih-weih Hung (National Chi Nan
University); Berlin Chen (National Taiwan Normal University) |
79 |
FastSVC:
Fast Cross-Domain Singing Voice Conversion with Feature-wise Linear
Modulation |
Songxiang
LIU (The Chinese University of Hong Kong)*; Yuewen Cao (CUHK); Na Hu
(Tencent); Dan Su (Tencent); Helen Meng (The Chinese University of Hong Kong) |
709 |
Spatial
audio object coding based on time-frequency shifting and scheduling |
Chenhao
Hu (wuhan university); Ruimin Hu (Wuhan University)*; Xiaochen Wang (Wuhan
University); Yulin Wu (Wuhan University) |
711 |
LOW
BITRATES AUDIO OBJECT CODING USING CONVOLUTIONAL AUTO-ENCODER AND DENSENET
MIXTURE MODEL |
Yulin
Wu (Wuhan University); Ruimin Hu (Wuhan University)*; Chenhao Hu (wuhan
university); Shanfa Ke (Wuhan University); Gang Li (Wuhan University);
Xiaochen Wang (Wuhan University) |
1022 |
Efficient
multi-step audio object coding with limited residual information |
Chenhao
Hu (wuhan university); Ruimin Hu (Wuhan University)*; Xiaochen Wang (Wuhan
University); Yulin Wu (Wuhan University); Wenke Liu (Wuhan University) |
964 |
Deep
Speaker Conditioning for Speech Emotion Recognition |
Andreas
Triantafyllopoulos (audEERING GmbH / University of Augsburg)*; Shuo Liu
(University of Augsburg); Björn Schuller (University of Augsburg) |
|
|
|
O6 |
Special
Session: Deep Learning for Multimedia Applications with Limited Supervision |
Time |
|
Chair |
Joey
Tianyi Zhou (National University of Singapore) |
ID |
Title |
Author |
107 |
Near
Real Feature Generative Network for Generalized Zero-Shot Learning |
Jingren
Liu (Nanjing University of Science and Technology); Haoyue Bai (Nanjing
University of Science and Technology); Haofeng Zhang (Nanjing University of
Science and Technology)*; Li Liu (the inception institute of artificial intelligence) |
124 |
Saliency-Guided
Complementary Attention for Improved Few-Shot Learning |
Linglan
Zhao (Shanghai Jiao Tong University)*; Ge Liu (Shanghai Jiao Tong
University); Da-shan Guo (Shanghai Jiao Tong University); Wei Li (Shanghai
Jiao Tong University); Xiangzhong Fang (Shanghai Jiao Tong University) |
271 |
Unsupervised
Video Person Re-identification via Noise and Hard frame Aware Clustering |
Pengyu
Xie (Wuhan University of Science and Technology); Xin Xu (Wuhan University of
Science and Technology)*; Zheng Wang (The University of Tokyo); Toshihiko
Yamasaki (The University of Tokyo) |
298 |
Dual-regularization
Complementary Learning for Image Classification |
Lingjuan
Ge (Wuhan University); Mingming Gong (University of Melbourne); Yutian Lin
(Wuhan University)*; Bo Du (Wuhan University) |
411 |
Multi-domain
Synchronous Refinement Network for Unsupervised Cross-Domain Person
Re-Identification |
Sikai
Bai ( Northwestern Polytechnical University); Junyu Gao (Northwestern
Polytechnical University, Center for OPTical IMagery Analysis and Learning);
Qi Wang (Northwestern
Polytechnical University)*; Xuelong Li (Northwestern Polytechnical
University) |
675 |
Few-Shot
Defect Segmentation Leveraging Abundant Defect-free Training Samples Through
Normal Background Regularization and Crop-and-Paste Operation |
Dongyun
Lin (Institute for Infocomm Research)*; Yanpeng Cao (ZJU); Wenbin Zhu
(Zhejiang University); Yiqun Li (Institute for Infocomm Research) |
|
|
|
O7 |
Multimedia activity analysis and
understanding |
Time |
|
Chair |
Zhiyong
Wang (The University of Sydney) |
ID |
Title |
Author |
80 |
Relationship-aware
Primal-Dual Graph Attention Network for Scene Graph Generation |
Hao
Zhou (National University of Defense Technology); Tingjin Luo (College of
Liberal Arts and Sciences, National University of Defense Technology)*; Jun
Zhang (Science and Technology on Information Systems Engineering Laboratory,
National University of Defense Technology); Jun Lei (National University of
Defense Technology); Shuohao LI (College of Information System and
Management, National University of Defense Technology) |
100 |
PAL-Net:
Predicate-Aware Learning Network for Visual Relationship Recognition |
Liang
Xu (Shanghai Jiao Tong University); Yong-Lu Li (Shanghai Jiao Tong
University); Mingyang Chen (Shanghai Jiaotong University); Yan Hao (Shanghai
Jiao Tong University); Cewu Lu (Shanghai Jiao Tong University)* |
215 |
DIVING
INTO THE RELATIONS: LEVERAGING SEMANTIC AND VISUAL STRUCTURES FOR VIDEO
MOMENT RETRIEVAL |
Ziyue
Wu (Student)*; Junyu Gao (CASIA); Shucheng Huang (Jiangsu University of
Science and Technology); Changsheng Xu (CASIA) |
563 |
Multimodal-Semantic
Context-Aware Graph Neural Network for Group Activity Recognition |
Tianshan
Liu (The Hong Kong Polytechnic University)*; Rui Zhao (The Hong Kong
Polytechnic University ); Kin-Man Lam (The Hong Kong Polytechnic University) |
676 |
Temporally
Coarse to Fine Snippets Relationship Learning with Graph Convolution for
Temporal Action Proposal Generation |
Shuaicheng
1 Li (Fudan University)*; Rui-Wei Zhao (Fudan University); Shuyu Miao (Fudan
University); Rui Feng (Fudan University) |
906 |
Recurrent
Graph Convolutional Autoencoder for Unsupervised Skeleton-Based Action
Recognition |
Han
Yao (Tongji University); S-J Zhao (HaiBa Technology)*; Chi Xie (Tongji
University); Kenan Ye (Tongji University); Shuang Liang (Tongji University) |
|
|
|
O8 |
Image/Video
Enhancement II |
Time |
|
Chair |
Bihan
Wen (Nanyang Technological University) |
ID |
Title |
Author |
741 |
Structure-Resonant
Discriminator for Image Super-Resolution |
Jaerin
Lee (Seoul National University)*; Kyoung Mu Lee (Seoul National University) |
846 |
Asymmetric
Stereo Color Transfer |
Yicheng
Wang (University of Science and Technology of China); Jiayong Peng
(University of Science and Technology of China); Yueyi Zhang (University of
Science and Technology of China); Shan Liu (Tencent America); Xiaoyan Sun
(University of Science and Technology of China); Zhiwei Xiong (University of
Science and Technology of China)* |
878 |
Residual
Attention Block Search for Lightweight Image Super-Resolution |
Wenrui
Liao (HFUT); Zhong-Qiu Zhao (HFUT)*; Hao Shen (HFUT); Weidong Tian (HFUT) |
893 |
HALDeR:
Hierarchical Attention-guided Learning with Detail-refinement for
Multi-Exposure Image Fusion |
Jinyuan
Liu (Dalian University of Technology); JingJie Shang (Dalian University of
Technology); Risheng Liu (Dalian University of Technology); Xin Fan (Dalian
University of Technology)* |
1020 |
Deep
Deblocker Driven Adaptive Iteration Scheme for Compressed Image Recovery |
Chao
Ren (Sichuan University)*; Xiaohai He (Sichuan University); Linbo Qing
(Sichuan University, China); Yuanzhouhan Cao (Beijing Jiaotong University) |
1094 |
Structure-Oriented
Progressive Low-rank Image
Restoration for Defending Adversarial Attacks |
Zhiqun
Zhao (University of Missouri-Columbia); Hengyou Wang (Beijing University of
Civil Engineering and Architecture); HAO SUN (University of
Missouri-Columbia); Wenming Cao (Shenzhen University); Zhihai He (University
of Missouri Columbia)* |
|
|
|
O9 |
Multimedia representation learning |
Time |
|
Chair |
Wei-Ta
Chu (National Cheng Kung University) |
ID |
Title |
Author |
9 |
Fine-Grained
Image Retrieval via Multiple Part-level Feature Ensemble |
Gang
Cao (Shenzhen University); Yingying Zhu (Shenzhen University)*; Xiufan Lu
(Shenzhen University) |
290 |
Cross-View
Equivariant Auto-Encoder |
Zhibin
Wan (School of Intelligence and Computing, Tianjin University); Changqing
Zhang (Tianjin university)*; Yu Geng (Tianjin University); Huazhu Fu
(Inception Institute of Artificial Intelligence); Xi Peng (College of
Computer Science, Sichuan Univerisity); Pengfei Zhu (tianjin university);
Qinghua Hu (Tianjin University) |
469 |
Noise
Homogenization via Multi-Channel Wavelet Filtering for High-Fidelity Sample
Generation in GANs |
Shaoning
Zeng (Yangtze Delta Region Institute (Hu Zhou), University of Electronic
Science and Technology of China)*; Bob Zhang (Univerisity of Macau) |
471 |
Semantically-Guided Disentangled
Representation for Robust Gait Recognition |
Tianrui
Chai (Beihang University)*; Xinyu Mei (Beihang University); Annan Li (Beijing
University of Aeronautics and Astronautics); Yunhong Wang (State Key
Laboratory of Virtual Reality Technology and System, Beihang University,
Beijing 100191, China) |
480 |
Self-Guided
Deep Multi-view Subspace Clustering Network |
Beilei
Cui (Dalian University of Technology); Hong Yu (Dalian University of
Technology)*; Linlin Zong (Dalian University of Technology); Ziyang Cheng
(Dalian University Of Technology) |
624 |
Efficient
Sketch Recognition via Compact Spatial Embedding Graph Neural Networks |
Hanhui
Li (Nanyang Technological University)*; Xudong Jiang (Nanyang Technological
University); boliang guan (Sun Yat-sen University); Nadia Magnenat Thalmann (Nanyang
Technological University) |
|
|
|
O10 |
3D
stereo computing |
Time |
|
Chair |
Shuai
Li (Shandong University) |
ID |
Title |
Author |
127 |
Disparity
Estimation with Scene Depth Cues |
lei
chen (tsinghua university)*; Zongqing Lu (Tsinghua University international
Graduate School at Shenzhen); Qingmin Liao (Tsinghua Univeristy); Haoyu Ma
(Tsinghua University); Jing-Hao Xue (University College London) |
225 |
Learning
Depth from Single Image using Depth-Aware Convolution and Stereo Knowledge |
Zhenyao
Wu (University of South Carolina)*; Xinyi Wu (University of South Carolina);
Xiaoping Zhang (Wuhan University); Song Wang (University of South Carolina);
Lili Ju (University of South Carolina) |
295 |
Fast
Multi-Scale Residual Fusion Network for Stereo Matching |
Zijing
Huang ( Peking University Shenzhen Graduate School); Jun Peng (Peking
University Shenzhen Graduate School); Wangduo Xie (Peking University Shenzhen
Graduate School); Qiuping Li (Peking University Shenzhen Graduate School);
Yong Zhao (Peking University Shenzhen Graduate School)* |
399 |
TAG-Reg:
Iterative Accurate Global Registration Algorithm |
Biao
Li (Xi'an Jiaotong University); Qixing Xie (Xi'an Jiaotong University);
Shaoyi Du (Xi'an Jiaotong Unviersity)*; Wenting Cui (Xi'an Jiaotong
University); Runzhao Yao (Xi'an Jiaotong University); Yue Gao (Tsinghua
University); nanning zheng (Institute of Artificial Intelligence and
Robotics, Xi'an Jiaotong University ) |
780 |
Better
stereo matching from simple yet effective wrangling of deep features |
lei
chen (tsinghua university)*; Zongqing Lu (Tsinghua University international
Graduate School at Shenzhen); Qingmin Liao (Tsinghua Univeristy); Jing-Hao
Xue (University College London) |
1352 |
AUTOMATIC
CHECKERBOARD DETECTION FOR ROBUST CAMERA CALIBRATION |
Ben
Chen (Huazhong University of Science and Technology; Alibaba Group)*; Yuyao
Liu (Huazhong University of Science and Technology); Caihua Xiong (School of
Mechanical Science and Engineering, Huazhong University of Science and
Technology) |
|
|
|
O11 |
Multimedia
for society and health |
Time |
|
Chair |
Liping
Chen (Microsoft) |
ID |
Title |
Author |
1053 |
Sample
Efficient Lung Segmentation using Group structured Conditional Variational
Data Imputation |
Yan
Li (East China Normal University); Guitao Cao (East China Normal
University)*; Wenming Cao (Shenzhen University) |
261 |
Integrating
Performance and Side Factors into Embeddings for Deep Learning-Based
Knowledge Tracing |
Liangliang
He (National University of Defense Technology)* |
857 |
unsupervised
domain adaptation based image synthesis and synergistic adversarial learning
for optic disc and cup segmentation |
Weixin
Liu (Shenzhe University); Haijun
Lei (Shenzhen University);
Hai Xie (Shenzhen University); Benjian Zhao (Shenzhen University); Baiying
Lei (Shenzhen University)* |
65 |
Let's
Find Fluorescein: Cross-Modal Dual Attention Learning for Fluorescein Leakage
Segmentation in Fundus Fluorescein Angiography |
Yang
Wen (School of Computer Science and Engineering, University of Electronic
Science and Technology of China); Leiting Chen (School of Computer Science
and Engineering, University of Electronic Science and Technology of China);
Lifeng Qiao (University of Electronic Science and Technology of China); Yu
Deng (King's College London); Haisheng Chen (University of Electronic Science
and Technology of China); Tian Zhang (School of Computer Science and
Engineering, University of Electronic Science and Technology of China); Chuan
Zhou (School of Computer Science and Engineering, University of Electronic
Science and Technology of China)* |
704 |
Shape-Adaptive
Convolutional Operator for Breast Ultrasound Image Segmentation |
Kuan
Huang (Utah State University); Yingtao Zhang (Harbin Institute of
Technology); H. D. Cheng (Utah State University)*; Ping Xing (First
Affiliated Hospital of Harbin Medical University) |
941 |
Bias
Field Poses a Threat to DNN-based X-Ray Recognition |
Binyu
Tian (Tianjin University); Qing Guo (Nanyang Technological University)*;
Felix Juefei-Xu (Alibaba Group, USA); Wen Le Chan (Nanyang Technological
University); Yupeng Cheng (Nanyang Technological University, Singapore);
Xiaohong Li (Tianjin University);
Xiaofei Xie (Nanyang Technological University); Shengchao Qin (Teesside
University) |
|
|
|
O12 |
Special
Session: Advancd Video Coding and Deep Active Learning |
Time |
|
Chair |
Hui
Yuan (Shandong University) |
ID |
Title |
Author |
710 |
SPLIT
UNIT CODING ORDER FOR VIDEO CODING |
Yinji
Piao (Samsung Electronics)*; Kiho Choi (Gachon Univerisity); Min Woo Park
(Samsung Electronics); Minsoo Park (Samsung Electronics); Kwang Pyo Choi
(Samsung Electronics) |
787 |
IMPROVED
CHROMA FROM LUMA PREDICTION IN AV1 BASED ON VIRTUAL CHROMA BLOCK GENERATION |
Junyan
Huo (Xidian University)*; Menglin Zhang (Xidian University); Wenhan Qiao
(Xidian University); FuZheng Yang (Xidian University); Hui Su (Google
Inc.); Debargha Mukherjee (Google
Inc) |
904 |
ANGULAR
WEIGHTED PREDICTION FOR NEXT-GENERATION VIDEO CODING STANDARD |
Yucheng
Sun (Hikvision Research Institute); Fangdong Chen (Hikvision Research
Institute); Li Wang (Hikvision Research Institute); Shiliang Pu (Hikvision
Research Institute)* |
223 |
Meta-Learning
Causal Feature Selection for Stable Prediction |
Zhaoquan
Yuan (School of Computing and Artificial Intelligence, Southwest Jiaotong
University); Xiao Peng (Southwest Jiaotong University); Xiao Wu (Southwest
Jiaotong University)*; Bingkun Bao (Nanjing University of Posts and
Telecommunications ); Changsheng Xu (CASIA) |
1244 |
Application
of Leading Indicator Forecasting based on Optimal Transmission in Financial
Technology |
Tao
Yin (Shanghai Jiao Tong University); Zhexi Zhang (Shanghai Jiao Tong
University ); Nianchi Zhang (East China Normal University); Ning Zhang
(Shanghai Jiao Tong University)* |
1541 |
Multi-scale
Enhanced Active Learning for Skeleton-based Action Recognition |
Yuhan
Zhang (University of Electronic Science and Technology of China)*; Zhiyu Zhao
(Nanjing University); Wen Li (University of Electronic Science and Technology
of China); Lixin Duan (University of Electronic Science and Technology of
China) |
|
|
|
O13 |
Emerging
multimedia applications |
Time |
|
Chair |
Zheng
Wang (The University of Tokyo) |
ID |
Title |
Author |
257 |
Capturing
Implicit Spatial Cues for Monocular 3D Hand Reconstruction |
Qi
Wu (Institute of Intelligent Machines,Chinese Academy of Sciences); Joya Chen
(University of Science and Technology of China); zhou xu (Hefei Institutes of
Physical Science,China Academy of Science); ZhiMing Yao (Hefei Institutes of
Physical Science, Chinese Academy of Sciences); Xianjun Yang (Hefei
Institutes of Physical Science, Chinese Academy of Sciences)* |
1186 |
Efficient
and Accurate Hypergraph Matching |
Jian
Hou (Dongguan University of Technology)*; Huaqiang Yuan (Dongguan University
of Technology) |
801 |
Zero-shot
Multi-Focus Image Fusion |
Xingyu
Hu (Harbin Institute of Technology)*; Junjun Jiang (Harbin Institute of
Technology); Xianming Liu (Harbin Institute of Technology); Jiayi Ma (Wuhan
University) |
1187 |
Attentive
Update of Multi-Critic for Deep Reinforcement Learning |
Qing
Li (USTC)*; Wengang Zhou
(University of Science and Technology of China); Yun Zhou (University of
Science and Technology of China); Houqiang Li (University of Science and
Technology of China) |
1347 |
Small
object recognition using a spatio-temporal neural network |
Zhibo
Liang (Harbin Institute of Technology)*; Shaohui Liu (Harbin Institute of
Technology); Wuzhen Shi (Shenzhen University); Xingtao Wang (Harbin Institute
of Technology; Peng Cheng Laboratory); Feng Jiang (Harbin Institute of
Technology, Harbin) |
1565 |
Person
Retrieval with Conv-Transformer |
Shengsen
Wu (Peking University)*; YAN BAI (Peking University); Ce Wang (Peking
University); Lingyu Duan (Peking University) |
|
|
|
O14 |
Multimedia semantic segmentation |
Time |
|
Chair |
Duc
Thanh Nguyen (Deakin University) |
ID |
Title |
Author |
642 |
MULTI-SCALE
FEEDBACK FEATURE REFINEMENT U-NET FOR MEDICAL IMAGE SEGMENTATION |
Xiaofei
Qin (University of Shanghai for Science and Technology); Minmin Xu (University of
Shanghai for Science and Technology); Chaoyang Zheng (University of Shanghai
for Science and Technology); Changxiang He (University of Shanghai for
Science and Technology); Xuedian
Zhang (University of Shanghai for Science and Technology)* |
898 |
Document
Layout Analysis via Dynamic Residual Feature Fusion |
Xingjiao
Wu (East China Normal University); ZiLing Hu (East China Normal University);
Xiangcheng Du (East China Normal University); Jing Yang (ECNU)*; Liang He
(ECNU) |
1109 |
SEMI-SUPERVISED
SEMANTIC SEGMENTATION VIA ENTROPY MINIMIZATION |
Jiawei
Wu (Fujian Agriculture and Forestry University); Haoyi Fan (Harbin University
of Science and Technology); Xiaoqing Zhang (Minjiang University); Shouying
Lin (Fujian Agriculture and Forestry University); Zuoyong Li (Minjiang University)* |
1184 |
EFRNET:
A LIGHTWEIGHT NETWORK WITH EFFICIENT FEATURE FUSION AND REFINEMENT FOR
REAL-TIME SEMANTIC SEGMENTATION |
Kuayue
Zhang (Tsinghua University); Qingmin Liao (Tsinghua Univeristy); Juncheng
Zhang (Tsinghua University); Shaojun Liu (Hong Kong University of Science and
Technology)*; Haoyu Ma (Tsinghua University); Jing-Hao Xue (University
College London) |
1205 |
Weakly-Supervised
Attribute Segmentation |
Guangzhen
Liu (Renmin University of China); Zhiwu Lu (Renmin University of China)* |
1518 |
CONFIDENCE-GUIDED
ADAPTIVE GATE AND DUAL DIFFERENTIAL ENHANCEMENT FOR VIDEO SALIENT OBJECT
DETECTION |
Pei-Jia
Chen (Sun Yat-sen University); Jian-Huang Lai (Sun Yat-sen University)*;
Guangcong Wang (Sun Yat-Sen University); Huajun Zhou (Sun Yat-sen University) |
|
|
|
O15 |
Image/Video Synthesis and Creation I |
Time |
|
Chair |
Tsung-Wei
Huang (Dolby Labs) |
ID |
Title |
Author |
45 |
Semantic-Aware
Video Color Style Transfer based on Temporal Consistent Sparse Patch
Constraint |
Yaxin
Liu (College of Computer Science and Software Engineering, Shenzhen
University); Xiaoyan Zhang (College of Computer Science and Software
Engineering, Shenzhen University)*; Xiaogang XU (The Chinese University of
Hong Kong) |
119 |
Learnable
Sampling 3D Convolution for video enhancement and action recognition |
Shuyang
Gu (University of Science and Technology of China)*; Jianmin Bao (Microsoft
Research Asia); Dong Chen (Microsoft Research Asia) |
137 |
ASTM:
An Attention based SpatioTemporal Model for Video Prediction Using 3D
Convolutional Neural Networks |
Zheng
Chang (University of Chinese Academy of Sciences )*; xinfeng zhang
(University of Chinese Academy of Sciences); Shanshe Wang (Peking
University); Siwei Ma (Peking University, China); Yan Ye (Alibaba Inc.); Wen
Gao (PKU) |
191 |
Adversarial
Adaptive Interpolation for Regularizing Representation Learning and Image
Synthesis in AutoEncoders |
Guanyue
Li (SCUT); Xiwen Wei (South China University of Technology); Sheng Qian
(Huawei Device Company Limited); Si Wu (South China University of
Technology)*; Zhiwen Yu (South China University of Technology); Hau San Wong
(City University of Hong Kong) |
220 |
Real-time
Masked Face Revealing for Video Conference |
Jinpeng
Lin (XiaMenUniversity); Pengfei Liu (School of Informatics, Xiamen
University); Yinglin Zheng (School of Informatics, Xiamen University); Wenjin
Deng (School of Informatics, Xiamen University); Ming Zeng (School of
Informatics, Xiamen University)* |
245 |
LI-NET:
LARGE-POSE IDENTITY-PRESERVING FACE REENACTMENT NETWORK |
Jin
Liu (1. Institute of Information Engineering,Chinese Academy of Sciences. 2.
School of Cyber Security, University of Chinese Academy of Sciences); Peng
Chen (1. Institute of Information Engineering,Chinese Academy of Sciences. 2.
School of Cyber Security, University of Chinese Academy of Sciences); Tao
Liang (1. Institute of Information Engineering,Chinese Academy of Sciences.
2. School of Cyber Security, University of Chinese Academy of Sciences);
Zhaoxing Li (Institute of Information Engineering,Chinese Academy of
Sciences); Cai Yu (1. Institute of Information Engineering,Chinese Academy of
Sciences. 2. School of Cyber Security, University of Chinese Academy of
Sciences); Shuqiao Zou (1. Institute of Information Engineering,Chinese
Academy of Sciences. 2. School of Cyber Security, University of Chinese
Academy of Sciences); Jiao Dai (Institute of Information Engineering,Chinese
Academy of Sciences)*; Jizhong Han (Institute of Information
Engineering,Chinese Academy of Sciences) |
|
|
|
O16 |
Object/Person
detection, Tracking and Recognition I |
Time |
|
Chair |
Chunjie
Zhang (Beijing Jiaotong University) |
ID |
Title |
Author |
72 |
PMAE:
PSEUDO MULTI-LABEL ATTENTION ENSEMBLE |
Xueman
Wang (Tiangong University); Ling Du (Tiangong University)*; Junbing Li
(Tianjin University) |
102 |
Improving
Facial Attribute Recognition by Group and Graph Learning |
Zhenghao
Chen (University of Sydney)*; Shuhang Gu (ETH Zurich, Switzerland); Feng Zhu
(Sensetime Group Limited); Jing Xu (Sensetime Group Limited); Rui Zhao
(Sensetime Group Limited) |
144 |
DSIC:
Dynamic Sample-Individualized Connector for Multi-scale Object Detection |
Zekun
Li (Institute of automation, Chinese Academy of Sciences); Yufan Liu
(Institute of Automation, Chinese Academy Sciences); Bing Li (National
Laboratory of Pattern Recognition (NLPR), Institute of Automation, Chinese
Academy of Sciences)*; Weiming Hu (Institute of Automation,Chinese Academy of
Sciences); Yanan Miao (CNCERT); Hong Zhang (CNCERT) |
368 |
Object
Decoupling with Graph Correlation for Fine-Grained Image Classification |
Qiushi
Guo (Alibaba Group)*; Mingchen Zhuge (China University of Geosciences);
Dehong Gao (Alibaba Group); Huiling Zhou (Alibaba); Xin Wang (Alibaba Group);
Xiaonan Meng (Alibaba Group) |
428 |
Exploring
Driving-aware Salient Object Detection via Knowledge Transfer |
Jinming
Su (Beihang University); Changqun Xia (Peng Cheng Laboratory)*; Jia Li
(Beihang University) |
489 |
Hands-on
Guidance for Distilling Object Detectors |
Yangyang
Qin (Huazhong University of Science and Technology)*; Hefei Ling (Huazhong
University of Science and Technology); Zhenghai He (Huazhong University of
Science and Technology); Yuxuan Shi (Huazhong University of Science and
Technology); Lei Wu (Huazhong University of Science and Technology) |
|
|
|
O17 |
Emerging
multimedia applications of deep learning I |
Time |
|
Chair |
Wei
Qi Yan (Auckland University of Technology) |
ID |
Title |
Author |
169 |
Enhancing
Adversarial Examples Via Self-Augmentation |
Lifeng
Huang (SunYat-sen university)*; Chengying Gao (Sun Yat-sen University );
Wenzi Zhuang (Sun Yat-sen University); Ning Liu (Sun Yat-sen University ) |
178 |
Unsupervised
ensemble learning via network generation |
Zhongfan
Zhang (South China University of Technology); Wenming CAO (The University of
Hong Kong)*; Cheng Liu (Shantou University); Rui Li (City University of Hong
Kong); Qianfen Jiao (City University of Hong Kong); Zhiwen Yu (South China
University of Technology); C. L. Philip Chen (South China University of
Technology); Hau San Wong (City University of Hong Kong) |
335 |
Learning
to transfer under unknown noisy environments: an universal weakly-supervised
domain adaptation method |
Xuan
Liu (Hunan University); Ying Huang (Hunan University)*; Shichang He (Hunan
University); Jiangjin Yin (Hunan University); Xinning Chen (Hunan
University); Shigeng Zhang (Central South University) |
651 |
Efficient
training of lightweight neural networks using Online Self-Acquired Knowledge
Distillation |
Maria
Tzelepi (Aristotle University of Thessaloniki)*; ANASTASIOS TEFAS (Aristotle
University of Thessaloniki) |
763 |
Flexible
Knowledge Distillation with an Evolutional Network Population |
Jie
Lei (Zhejiang University Of Technology); Zhao Liu (Ping An Life Insurance Of
China, Ltd.)*; Mingli Song (Zhejiang University); Juan Xu (Pingan Life
Insurance of China); Jianping Shen (PingAn Life Insurance of China); Ronghua
Liang (Zhejiang University of Technology) |
838 |
Cooperative
Learning for Noisy Supervision |
Hao
Wu (Cooperative Medianet Innovation Center, Shanghai Jiao Tong University)*;
Jiangchao Yao (Damo Academy, Alibaba Group); Ya Zhang (Cooperative Medianet
Innovation Center, Shang hai Jiao Tong University); Yan-Feng Wang
(Cooperative medianet innovation center of Shanghai Jiao Tong University) |
|
|
|
O18 |
Multimedia security, privacy and
forensic I |
Time |
|
Chair |
Jun
Wan (NLPR, CASIA) |
ID |
Title |
Author |
645 |
Multi-task
Wavelet Corrected Network For Image Splicing Forgery Detection and
Localization |
Xiuli
Bi (Chongqing University of Posts and Telecommunications); Zhang Zhipeng
(Chongqing university of post and telecommunications); Liu Yanbin (Chongqing
University of Posts and Telecommunications); bin xiao (Chongqing University
of Posts and Telecommunications)*; Weisheng Li (Chongqing University of Posts and
Telecommunications) |
1454 |
Multi-Modality
Image Manipulation Detection |
Chao
Yang (Hunan University)*; Zhiyu Wang (Hunan University); Huawei Shen
(Institute of Computing Technology, Chinese Academy of Sciences); Huizhou Li
(Hunan University); Bin Jiang (Hunan University) |
200 |
Video
Abnormal Event Detection via Context Cueing Generative Adversarial Network |
Zhi
Zhang (Shenzhen University); Sheng-hua Zhong (Shenzhen University)*; Yan Liu
(The Hong Kong Polytechnic University) |
247 |
Leveraging
Intra-domain Knowledge to Strengthen Cross-domain Crowd Counting |
Yiqing
Cai (East China Normal University); Lianggangxu Chen (East China Normal
University); Zhenwei Ma (The Third Research Institute Of Ministry Of Public
Security); Changhong lu (East China Normal University); Changbo Wang (East
China Normal University); Gaoqi He (East China Normal University)* |
282 |
DISCRIMINATIVE
AND GEOMETRICALLY ROBUST ZERO-WATERMARKING SCHEME FOR PROTECTING DIBR 3D
VIDEOS |
Xiyao
Liu (Central South University); Yayun Zhang (Central South University); Sibo
Du (Central South University); Jian Zhang (Central South University)*;
Ming Jiang ( Guilin University of
Electronic Technology); Hui Fang (Loughborough University) |
1037 |
H-StegoNet:
A Hybrid Deep Learning Framework for Robust Steganalysis |
Soumik
Mondal (A*STAR)*; Yeo Sze
Ling (ASTAR-Institute for
Infocomm Research, A*STAR); ArulMurugan Ambikapathi (ASTAR-Institute for
Infocomm Research, A*STAR) |
|
|
|
O19 |
Special
Session: Advanced Representation Learning for Robust Multimedia Image
Understanding |
Time |
|
Chair |
Guangwei
Gao (Nanjing University of Posts and Telecommunications) |
ID |
Title |
Author |
383 |
Learning
Homogeneous and Heterogeneous Co-Occurrences for Unsupervised Cross-modal
Retrieval |
Yang
Zhao (Nanjing University of Science and Technology); Weiwei Wang (Nanjing
University of Science and Technology); Haofeng Zhang (Nanjing University of
Science and Technology)*; BingZhang Hu (Newcastle University) |
643 |
Multimodal
Transformer Networks with Latent Interaction for Audio-Visual Event
Localization |
Yixuan
He (University of Electronic Science and Technology of China); Xing Xu
(University of Electronic Science and Technology of China)*; Xin Liu (Huaqiao
University); Weihua Ou (Guizhou Normal University); Huimin Lu (Kyushu
Institute of Technology) |
921 |
Disentangling
Prototype and Variation for Single Sample Face Recognition |
MENG
PANG (Nanyang Technological University); Binghui Wang (Duke University); Mang
YE (Wuhan University); Yiran Chen (Duke University); Bihan Wen (Nanyang
Technological University)* |
1178 |
Transferable
Feature Learning on Graphs Across Visual Domains |
Ronghang
Zhu (University of Georgia)*; Xiaodong Jiang (Facebook Inc); Jiasen Lu (Allen
Institute for AI); Sheng Li (University of Georgia) |
1452 |
Face
Super-Resolution through Dual-identity Constraint |
Fangfang
Cheng (Wuhan Institute of Technology)*; Tao Lu (Wuhan Institute of
Technology); Yu Wang (Wuhan Institute of technology); Yanduo Zhang (Wuhan
Institute of Technology) |
|
|
|
|
|
|
O20 |
Multimedia
Applications I |
Time |
|
Chair |
Yongshan
Zhang (University of Macau) |
ID |
Title |
Author |
908 |
DGD-NET:
LOCAL DESCRIPTOR GUIDED KEYPOINT DETECTION NETWORK |
Xiaotao
Liu (Tianjin University); Chen Meng (College of Intelligence and Computing,
Tianjin University, China); Fei-Peng
Tian (Tianjin University); Wei Feng (College of Intelligence and
Computing, Tianjin University, China)* |
1335 |
Multi-view
Tensor Clustering through Exploiting both Within-view and Across-view
High-order Correlations |
haiyan
wang (South China University of Technology); Guoqiang Han (South China
University of Technology); Yu Hu (South China University of Technology); Hong
Peng (South China University of Technology); Jiazhou Chen (South China
University of Technology); Bin Zhang (South China University of Technology);
Hongmin Cai (South China University of Technology)* |
1484 |
Path
Ranking Model For Entity Prediction |
xiao
long (USTC); MingHong Yao (University of Science and Technology of China);
Liansheng Zhuang (University of Science and Technology of China)*; Houqiang
Li (University of Science and Technology of China) |
1057 |
Learning
efficient rotation representation for point cloud via local-global
aggregation |
Ruibin
Gu (South China University of Technology); Qiuxia Wu (South China University
of Technology, China)*; Hongbin Xu (South China University of Technology);
Wing W.Y. Ng (South China University of Technology); Zhiyong Wang (The
University of Sydney) |
371 |
Model
Compression via Collaborative Data-free Knowledge Distillation for Edge
Intelligence |
Zhiwei
Hao (Beijing Institute of Technology)*; Yong Luo (Wuhan University); Zhi Wang
(Tsinghua University); Han Hu (Beijing Institute of Technology, China);
Jianping An (Beijing Institute of Technology) |
|
|
|
|
|
|
O21 |
Object/Person
detection, Tracking and Recognition II |
Time |
|
|
Chair |
Yu
Zhou (Institute of Information Engineering, CAS) |
ID |
Title |
Author |
547 |
Multi-view
Face Recognition using Deep Attention-based Face Frontalization |
Xiao-Hu
Shao (Chongqing Institute of Green and Intelligent Technology,Chinese Academy
of Sciences; University of Chinese Academy of Sciences)*; Junliang Xing
(Institute of Automation, Chinese Academy of Sciences); Ruihan Pan (Chongqing
Institute of Green and Intelligent Technology, Chinese Academy of Sciences);
Zhenghao Li (Chongqing Institute of Green and Intelligent Technology, Chinese
Academy of Sciences); Xiang-Dong Zhou (Chongqing Institute of Green and
Intelligent Technology,Chinese Academy of Sciences); Yu Shi (Chongqing
Institute of Green and Intelligent Technology,Chinese Academy of Sciences) |
899 |
CORE-Text:
Improving Scene Text Detection with Contrastive Relational Reasoning |
Jingyang
Lin (Sun Yat-Sen University); Yingwei Pan (JD AI Research)*; Rongfeng Lai (JD
AI Research); Xuehang Yang (JD AI Research); Hongyang Chao (Sun Yat-sen
University); Ting Yao (JD AI Research) |
923 |
SSDL:
Self-Supervised Dictionary Learning |
Shuai
Shao (China University of Petroleum (East China) College of Control Science
and Engineering); Lei Xing (China University of Petroleum(East China) College
of Oceanography and Space Informatics); wei yu (Harbin Institute of
Technology, School of computer science and technology); Rui Xu (China
University of Petroleum (East China) College of Control Science and
Engineering); yanjiang wang (China University of Petroleum (East China)
College of Control Science and
Engineering); baodi liu (China University of Petroleum (East China) College
of Information and Control Engineering)* |
974 |
DeepMix:
Online Auto Data Augmentation for Robust Visual Object Tracking |
Ziyi
Cheng (Kyushu University); Xuhong Ren (School of Computer Science and
Engineering, Tianjin University of Technology); Felix Juefei-Xu (Alibaba
Group, USA); Wanli Xue (Tianjin University of Technology)*; Qing Guo (Nanyang
Technological University); Lei Ma (University of Alberta); Jianjun Zhao
(Kyushu University) |
1006 |
MATTING
ENHANCED MASK R-CNN |
Lufan
Ma (Tsinghua University)*; Bin Dong (Southeast University); Jiangpeng Yan
(Tsinghua University); Xiu Li (Tsinghua University) |
1036 |
DEEP
CORRELATION FILTERS FOR ROBUST VISUAL TRACKING |
Xiang
Liu (Dongguan University of Technology)* |
|
|
|
O22 |
Image/Video Synthesis and Creation II |
Time |
|
Chair |
Ming-Ching
Chang (University at Albany - SUNY) |
ID |
Title |
Author |
403 |
STAE:
A SpatioTemporal Auto-Encoder for High-Resolution Video Prediction |
Zheng
Chang (University of Chinese Academy of Sciences )*; xinfeng zhang
(University of Chinese Academy of Sciences); Shanshe Wang (Peking
University); Siwei Ma (Peking University, China); Yan Ye (Alibaba Inc.); Wen
Gao (PKU) |
439 |
FEW-SHOT
KNOWLEDGE TRANSFER FOR FINE-GRAINED CARTOON FACE GENERATION |
Nan
Zhuang (Peking University)*; Cheng Yang (ByteDance Inc.) |
817 |
BargainNet:
Background-Guided Domain Translation for Image Harmonization |
Wenyan
Cong (Shanghai Jiao Tong University); Li Niu (Shanghai Jiao Tong
University)*; Jianfu Zhang (RIKEN AIP;Shanghai Jiao Tong University); Jing
Liang (Shanghai Jiao Tong University); Liqing Zhang (Shanghai Jiao Tong
University) |
1160 |
DNA-NET:
AGE AND GENDER AWARE KIN FACE SYNTHESIZER |
Pengyu
Gao (Southeast University); Joseph P Robinson (Northeastern University);
Jiaxuan Zhu (Southeast University); Chao Xia (Shanghai Jiao Tong University);
Ming Shao (University of Massachusetts Dartmouth); Siyu Xia (Southeast
University, China)* |
1163 |
Spatial
Content Alignment For Pose Transfer |
Wing
Yin Yu (CITY UNIVERSITY OF HONG KONG)*; Lai-Man Po (CITY UNIVERSITY OF HONG
KONG); Yuzhi Zhao (City University of Hong Kong); Jingjing Xiong (CITY
UNIVERSITY OF HONG KONG); Kin Wai Lau (CITYU UNIVERSITY OF HONG KONG) |
1339 |
INFRARED
AND VISIBLE IMAGE FUSION BASED ON MODAL FEATURE FUSION NETWORK AND DUAL
VISUAL DECISION |
Yong
Yang (School of Information Technology, Jiangxi University of Finance and
Economics); Jiaxiang Liu (School of Information Technology, Jiangxi
University of Finance and Economics)*; Shuying Huang (School of Software and
Communication Engineering, Jiangxi University of Finance and Economics);
Weiguo Wan (School of Software and Communication Engineering, Jiangxi
University of Finance and Economics); Xiangkai Kong (School of Information
Technology, Jiangxi University of Finance and Economics); Wang Zhang ( School
of Information Technology, Jiangxi University of Finance and Economics) |
|
|
|
O23 |
Multimedia analysis and understanding I |
Time |
|
Chair |
Bingpeng
Ma (University of Chinese Academy of Sciences) |
ID |
Title |
Author |
937 |
Cross-scene
Person Trajectory Anomaly Detection Based on Re-Identification |
Yuanxun
Li (Sun Yat-sen University, China); Ancong Wu (Sun Yat-sen University);
WEI-SHI ZHENG (Sun Yat-sen University, China)* |
1073 |
ACTION
PREDICTION NETWORK WITH AUXILIARY OBSERVATION RATIO REGRESSION |
Cuiwei
Liu (Shenyang Aerospace University)*; Yiming Gao (Shenyang Aerospace
University); Zhaokui Li (Shenyang Aerospace University); Chong Du (Shenyang
Aircraft Design and Research Institute); Fang Liu (Shenyang Aerospace
University;Northeastern University); Xiangbin Shi (Shenyang Aerospace
University) |
1082 |
GAIT
IDENTIFICATION BASED ON HUMAN SKELETON WITH PAIRWISE GRAPH CONVOLUTIONAL
NETWORK |
Ke
Xu (Shanghai Jiao Tong University)*; Xinghao Jiang (Shanghai Jiao Tong
University); Tanfeng Sun (Shanghai Jiao Tong University) |
1119 |
spatial
reasoning and context-aware attention network for skeleton-based action
recognition |
Dianlong
You (yanshan university); Ling Wang (yanshan university)*; Da Han (Cardiff
University); Shunpan Liang (yanshan university); Hongyang Liu (yanshan
university); Fuyong Yuan (yanshan university) |
1525 |
Edge
Enhancement Network for Weakly Supervised Semantic Segmentation |
Mei
Yu (Tianjin University); Junbin Wei (Tianjin University); Chenhan Wang
( Laboratory of OpenBayes Machine Intelligence Lab); Han Jiang (Laboratory of
OpenBayes Machine Intelligence Lab); Jian Yu (Tianjin University); Ruixuan
Zhang (College of Intelligence and Computing, Tianjin University); Xuewei Li
(Tianjin University)*; Ruiguo Yu (Tianjin University) |
1587 |
Associative
Segmentation for Instances and Semantics by perceiving neighborhood in Point
Clouds |
Yingying
Zhu (Shenzhen University); Biao Li (Shenzhen University); Qiang Huang
(Shenzhen University)* |
|
|
|
O24 |
Multimedia
interaction & Multimedia quality assessment |
Time |
|
Chair |
Jong-Seok
LEE (Yonsei University) |
ID |
Title |
Author |
959 |
FINE-GRAINED
DISCOURSE FOR METAPHOR DETECTION |
qimeng
yang (xinjiang university)*; Long Yu (Xinjiang University); Shengwei Tian
(Xinjiang University); jinmiao song (Xinjiang University) |
1128 |
Facial
Chirality: Using self-face reflection to learn discriminative features for
facial expression recognition |
Ling
Lo ( National Chiao Tung University); Hong Xia Xie (National Chiao Tung
University); Hong-Han Shuai (National Chiao Tung University); Wen-Huang Cheng
(National Chiao Tung University)* |
106 |
SKANET:
STRUCTURED KNOWLEDGE-AWARE NETWORK FOR VISUAL DIALOG |
Lei
Zhao (The University of Electronic Science and Technology of China); Lianli
Gao (The University of Electronic Science and Technology of China)*;
Yuyu Guo (UESTC); Jingkuan Song (UESTC); Heng Tao Shen (University of
Electronic Science and Technology of China (UESTC)) |
755 |
A
No-reference Evaluation Metric for Low-light Image Enhancement |
Zicheng
Zhang (Shanghai Jiaotong university)*; Wei Sun (Shanghai Jiao Tong
Unviersity); Xiongkuo Min (Shanghai Jiao Tong University); Wenhan Zhu
(Shanghai Jiao Tong University); Tao Wang (ShanghaiJiaotongUniversity); Wei
Lu (Shanghai Jiao Tong University); Guangtao Zhai (Shanghai Jiao Tong
University) |
1158 |
DEEP
NEURAL NETWORKS FOR END-TO-END SPATIOTEMPORAL VIDEO QUALITY PREDICTION AND
AGGREGATION |
Junming
Chen (Peking University); Haiqiang Wang (Pengcheng Laboratory); Munan Xu
(Shenzhen Graduate School, Peking University); Ge Li (SECE, Shenzhen Graduate
School, Peking University)*; Shan Liu (Tencent America) |
1465 |
No-Reference
Deep Quality Assessment of Compressed Light Field Images |
Zixuan
Guo (Peking University); Wei Gao (Peking University & Peng Cheng
Laboratory)*; Haiqiang Wang (Pengcheng Laboratory); Junle Wang (Tencent);
Songlin Fan (Peking University ) |
|
|
|
O25 |
Multimedia security, privacy and
forensic II |
Time |
|
Chair |
Liang
He (Tsinghua University) |
ID |
Title |
Author |
18 |
Blind
Adversarial Pruning: Towards the Comprehensive Robust Models with Gradually
Pruning Against Blind Adversarial Attacks |
Haidong
Xie (Qian Xuesen Laboratory, China Academy of Space Technology); Lixin Qian
( Wuhan University of Technology); Xueshuang Xiang (Qian Xuesen Laboratory of
Space Technology)*; Naijin Liu (Qian Xuesen Laboratory, China Academy of
Space Technology) |
695 |
EFFICIENT
OPEN-SET ADVERSARIAL ATTACKS ON DEEP FACE RECOGNITION |
Haojie
Yuan (University of Science and Technology of China); Qi Chu (University of
Science and Technology of China)*; Feng Zhu (University of Science and
Technology of China); Rui Zhao (SenseTime Group Limited); Bin Liu (University
of Science and Technology of China); Nenghai Yu (University of Science and
Technology of China) |
920 |
CONTENT-INDEPENDENT
ONLINE HANDWRITING VERIFICATION BASED ON MULTI-MODAL FUSION |
Nan
Ji (School of Cyberspace Security, University of Science and Technology of
China); Bin Liu (University of Science and Technology of China)*; Zhiwei Zhao
(University of Science and Technology of China); Yan Lu (University of
Sydney); Qi Chu (University of Science and Technology of China); Zhenchao Jin
(University of Science and Technology of China); Nenghai Yu (University of
Science and Technology of China) |
1324 |
On
Generating JPEG Adversarial Images |
Mengte
Shi (Fudan University); Sheng Li (Fudan University); Zhaoxia Yin (Anhui
University); Xinpeng Zhang (School of Computer Science, Fudan University)*;
Zhenxing Qian (School of Computer Science, Fudan University) |
1459 |
Transferable
Adversarial Examples for Anchor Free Object Detection |
quanyu
liao (Chengdu University of Information Technology); Xin Wang (Keya Medical);
bin kong (curacloud); Siwei Lyu (University at Buffalo); Bin Zhu (Microsoft
Research Asia); youbing yin (Curacloud); qi song (Curacloud); Xi Wu (Chengdu
University of Information Technology)* |
|
|
|
|
|
|
O26 |
Special
Session: Recent Advance in Depth-Related Processing and Applications |
Time |
|
Chair |
Runmin
Cong (Beijing Jiaotong University) |
ID |
Title |
Author |
105 |
SN-Graph:
a Minimalist 3D Object Representation for Classification |
Siyu
Zhang (Donghua University); Hui Cao (Donghua University); Yuqi Liu (Donghua
University); Shen Cai (Donghua University)*; Yanting Zhang (Donghua
University); Yuanzhan Li (Donghua University); Xiaoyu Chi (Goertek Co., Ltd) |
259 |
Stereo
Superpixel Segmentation via Dual-attention Fusion Networks |
Ruiqi
Wu (Wuhan University of Technology); Yajuan Du (Wuhan University of
Technology); Hua Li (Huazhong University of Science and Technology; City
University of Hong Kong)*; Yucong Dai (Wuhan University of Technology) |
278 |
IRS:
A Large Naturalistic Indoor Robotics Stereo Dataset to Train Deep Models for
Disparity and Surface Normal Estimation |
Qiang
Wang (Hong Kong Baptist University)*; Shizhen Zheng (HKBU); Qingsong Yan
(Wuhan University); Fei Deng (Wuhan University); Kaiyong Zhao (Hong Kong
Baptist University); Xiaowen Chu (Hong Kong Baptist University) |
1162 |
QoE-based
Neural Live Streaming Method With Continuous Dynamic Adaptive Video Quality
Control |
Xuekai
WEI (City University of Hong Kong); Mingliang Zhou (Chongqing University)*;
Sam Kwong (City Univeristy of Hong Kong); Hui Yuan (Shandong University); Tao
Xiang (Chongqing University) |
1345 |
DUAL
REGULARIZATION BASED DEPTH MAP SUPER-RESOLUTION WITH GRAPH LAPLACIAN PRIOR |
Longhua
Sun (Beijing University of Technology); Jin Wang (Beijing University of
Technology)*; Ruiqin Xiong (Peking University); Yunhui Shi (Beijing
University of Technology); Qing Zhu (Beijing University of Technology);
Baocai Yin (Beijing University of
Technology) |
|
|
|
|
|
|
O27 |
Image/Video
Enhancement III |
Time |
|
Chair |
Chau-Wai
Wong (North Carolina State University) |
ID |
Title |
Author |
1174 |
Image
demoireing with a dual-domain distilling network |
Hailing
Wang (Tianjin University); Qiaoyu Tian (Tianjin University); Liang Li
(Tianjin University)*; Xiaojie Guo (Tianjin University) |
1176 |
Contrastive
Feature Decomposition for Image Reflection Removal |
Xin
Feng (Harbin Institute of Technology, Shenzhen); Haobo Ji (Harbin Institute
of Technology,Shenzhen); Bo Jiang (Harbin Institute of Technology Shenzhen);
Wenjie Pei (Harbin Institute of Technology, Shenzhen); Fanglin Chen ( Harbin
Institute of Technology, Shenzhen); Guangming Lu ( Harbin Institute of
Technology, Shenzhen)* |
1189 |
RGB
GUIDED DEPTH MAP SUPER-RESOLUTION WITH COUPLED U-NET |
Yingjie
Cui (Tsinghua University); Qingmin Liao (Tsinghua Univeristy)*; Wenming Yang
(Tsinghua University); Jing-Hao Xue (University College London) |
1374 |
Blur
Invariant Kernel-Adaptive Network for Single Image Blind Deblurring |
Sungkwon
An (Seoul National University ); Hyungmin Roh (Seoul National University);
Myungjoo Kang (Seoul National University)* |
1373 |
STRUCTURAL
PRIOR GUIDED IMAGE INPAINTING FOR COMPLEX SCENE |
Shuxin
Wei (Sun Yat-sen University); Chengying Gao (Sun Yat-sen University )* |
1517 |
BWIN:
A Bilateral Warping Method for Video Frame Interpolation |
Fanyong
Xue (Shanghai Jiao Tong University); Jie Li (Shanghai Jiao Tong University)*;
Jiannan Liu (Shanghai Jiao Tong University); Chentao Wu (Shanghai Jiao Tong
University) |
|
|
|
O28 |
Multimedia analysis and understanding
II |
Time |
|
Chair |
Liping
Chen (Microsoft) |
ID |
Title |
Author |
953 |
A
lightweight Saliency Prediction Model for Omnidirectional Images |
dandan
zhu (Shanghai Jiao Tong University)*; yongqing chen ( Hainan Air Traffic
Management Sub-Bureau); Defang Zhao (Tongji University); Xiongkuo Min
(Shanghai Jiao Tong University); Qiangqiang Zhou (Jiangxi Normal University);
Shaobo Yu (East China Normal University); Guangtao Zhai (Shanghai Jiao Tong
University); Xiaokang Yang (Shanghai Jiao Tong University) |
1277 |
Multi-Scale
Attention Constraint Network for Fine-Grained Visual Classification |
Yaqing
Hou (Dalian University of Technology)*; zhang wenkai (Dalian University of
Technology); dongsheng zhou (dlu.edu.cn); Hongwei Ge (Dalian University of
Technology); Qiang Zhang (Dalian University of Technology); Xiaopeng Wei
(Dalian University of Technology) |
1314 |
Multiple
Hub-driven Attention Graph Network for Scene Graph Generation |
Yang
Yao (Sun Yat-sen University)*; Bo Gu (Sun Yat-sen University) |
1105 |
HRDNet:
High-resolution Detection Network for Small Objects |
Ziming
Liu (Inria); Guangyu Ryan Gao (Beijing Institute of Technology)*; Lin Sun
(Samsung, USA); zhiyuan fang (Beijing Institute of Technology ) |
1157 |
Meta-Graph
Adaptation for Visual Object Tracking |
Qiangqiang
Wu (City University of Hong Kong); Antoni Chan (City University of Hong Kong,
Hong, Kong)* |
1288 |
CUTMIX
DUAL BRANCH NETWORK FOR PERSON RE-IDENTIFICATION |
Zengming
Tang (Shanghai Advanced Research Institute, Chinese Academy of Sciences,
Shanghai, China)*; Jun Huang (Shanghai Advanced Research Institute, Chinese
Academy of Sciences) |
|
|
|
O29 |
Emerging
multimedia applications of deep learning II |
Time |
|
|
Chair |
Maggie
Zhu (Purdue University) |
ID |
Title |
Author |
895 |
DEEP
TIERED IMAGE SEGMENTATION FOR DETECTING INTERNAL ICE LAYERS IN RADAR IMAGERY |
Yuchen
Wang (Indiana University)*; Mingze Xu (Amazon); John Paden (University of
Kansas); Lora Koenig (Univeristy of Colorado); Geoffrey Charles Fox (Indiana University);
David Crandall (Indiana University) |
947 |
ATTENTION
DRIVEN SELF-SIMILARITY CAPTURE FOR MOTION DEBLURRING |
Jie
Zhang (School of Computer Science, Fudan University); Chuanfa Zhang (Fudan
University); Jiangzhou Wang (School of Computer Science, Fudan University);
Qingyue Xiong (Fudan University); Yingtao Zhang (School of Computer Science,
Fudan University); Wenqiang Zhang (Fudan University)* |
986 |
Wide-sense
Stationary Policy Optimization with Bellman Residual on Video Games |
Chen
Gong (Institute of Automation, Chinese Academy of Sciences)*; Qiang He
(Institute of Automation, Chinese Academy of Sciences); Yunpeng Bai
(Institute of Automation, Chinese Academy of Sciences); Xinwen Hou (Institute
of Automation, Chinese Academy of Sciences); Guoliang Fan (Institute of
Automation, Chinese Academy of Sciences); Yu Liu (Institute of Automation,
Chinese Academy of Sciences) |
1076 |
ACSNet:
Adaptive Cross-scale Network with Feature Maps Refusion for Vehicle Density
Detection |
Zuhao
Ge (Shantou University); Yuhui Li (Shantou University); Cheng Liang (Shantou
University); Youyi Song (The Hong Kong Polytechnic University); Teng Zhou
(Shantou University)*; Jing Qin (The Hong Kong Polytechnic University) |
1150 |
Unsupervised
Domain Adaptation via Cluster Alignment with Maximum Classifier Discrepancy |
Mohamed
Azzam (City University of Hong Kong); Si Wu (South China University of
Technology); Aurele Tohokantche Gnanha
(City University of Hong Kong); Qianfen Jiao (City University of Hong
Kong); Hau San Wong (City University of Hong Kong)* |
1495 |
LIDAR-BASED
REAL-TIME MAPPING FOR DIGITAL TWIN DEVELOPMENT |
Evan
Brock (University of Tennessee at Chattanooga); Chengxuan Huang (University
of California, Davis); Dalei Wu (University of Tennessee at Chattanooga)*; Yu
Liang (University of Tennessee at Chattanooga) |
|
|
|
O30 |
Multimedia
Applications II |
Time |
|
Chair |
Xinggong
Zhang (Peking University) |
ID |
Title |
Author |
204 |
CoConv:
Learning Dynamic Cooperative Convolution for Image Recognition |
Kien
X Nguyen (Texas Christian University); Tiffany Ryu (University of North
Texas); Jocelyn Zhang (University of North Texas); Xu Ma (University of North
Texas)*; Qing Yang (University of North Texas); Song Fu (University of North
Texas); Paparao Palacharla (Fujitsu Network Communications); Nannan Wang
(Fujitsu Network Communications); Xi Wang (Fujitsu Network Communications) |
1192 |
DRL-based
Collaborative Edge Content Replication with Popularity Distillation |
Haopeng
Yan (Tsinghua University); Zeming Chen (Tsinghua University); Zhi Wang
(Tsinghua University); Wenwu Zhu (Tsinghua University)* |
656 |
Handwriting
Trajectory Recovery from Off-Line Multi-Stroke Characters by Deep Ordering
Prediction and Heuristic Search |
Tie-Qiang
Wang (CASIA)*; Cheng-Lin Liu (Institute of Automation of Chinese Academy of
Sciences) |
239 |
CNN-Based
Depth Map Prediction for Fast Block Partitioning in HEVC Intra Coding |
Aolin
Feng (University of Science and Technology of China); Changsheng Gao
(University of Science and Technology of China); Li Li (University of Science
and Technology of China); Dong Liu (University of Science and Technology of
China)*; Feng Wu (University of Science and Technology of China) |
1463 |
A
REAL-TIME H.266/VVC SOFTWARE DECODER |
Bin
Zhu (Tencent America); Shan Liu (Tencent America); Yuan Liu (Tencent
America); Yi Luo (Tencent America); Jing Ye (Tencent America); Haiyan Xu
(Tencent America); Ying Huang (Tencent America); Hualong Jiao (Tencent
America); Xiaozhong Xu (Tencent America)*; Xianguo Zhang (Tencent); Chenchen
Gu (Tencent) |
16 |
On
Forecasting Dynamics in Online Discussion Forums |
Chen
Ling (University of Delaware); Di Cui (University of Delaware); Guangmo Tong
(University of Delaware)*; Jianming ZHU (University of Chinese Academy of
Sciences) |
|
|
|
O31 |
Special
Session: Multimedia Knowledge-Driven Deep Analysis and Forensics/Security |
Time |
|
Chair |
Chang-Tsun
Li (Deakin University) |
ID |
Title |
Author |
209 |
Robust
Image Denoising with Texture-Aware Neural Network |
Bo
Fu (Dalian University of Technology)*; Liyan Wang (Liaoning Normal
University); Zhongxuan Luo (DALIAN UNIVERSITY OF TECHNOLOGY) |
555 |
Multi-Graph
Based Hierarchical Semantic Fusion for Cross-Modal Representation |
Lei
Zhu ()*; Chengyuan Zhang (Hunan University); Jiayu Song (Central South
University); Liangcheng Liu (UniversityofMelbourne); Shichao Zhang ();
Yangding Li (Hunan Normal University) |
96 |
ON
CONSTRUCTING A BETTER CORRELATION PREDICTOR FOR PRNU-BASED IMAGE FORGERY
LOCALIZATION |
Xufeng
Lin (Deakin University)*; Chang-Tsun Li (Deakin University, Australia) |
337 |
VideoForensicsHQ:
Detecting High-quality Manipulated Face Videos |
Gereon
Fox (Max Planck Institute for Informatics)*; Wentao Liu (Max Planck Institute
for Informatics); Hyeongwoo Kim (Max Planck Institute for Informatics);
Hans-Peter Seidel (Max Planck Institute for Informatics); Mohamed Elgharib
(Max Planck Institute for Informatics); Christian Theobalt (MPI Informatik) |
1185 |
DEFAKEHOP:
A LIGHT-WEIGHT HIGH-PERFORMANCE DEEPFAKE DETECTOR |
Hong-Shuo
Chen (USC)*; Mozhdeh Rouhsedaghat (University of Southern California); Hamza
H Ghani (USC); Shuowen Hu (US Army Research Laboratory); Suya You (U.S. Army
Research Laboratory); C.-C. Jay Kuo (USC) |
|
|
|
|
|
|
O32 |
Industry
and Application Track I |
Time |
|
Chair |
Zhang
Wei (Singapore Institute of Technology) |
ID |
Title |
Author |
W11 |
MT-GAN:
A Training Framework to Enhance Image Classification Task with Image
Translation |
Qun
Li (Microsoft)*; Changbo Hu (Microsoft); Keng-hao Chang (Microsoft); Ruofei
Zhang (Microsoft) |
W90 |
A
time-variant QoE model based on real video streaming data |
Shengbin
Meng (ByteDance Inc.)*; Minyin Zeng (ByteDance Inc.); Junlin Li (ByteDance
Inc.); Yue Wang (Beijing ByteDance Technology Co., Ltd.); Zongming Guo
(Peking University) |
W111 |
Hardware-aware
Model Optimization Tool For Embedded Devices |
Cagri
Ozcinar (Samsung)*; Dongsun Kim (Samsung R&D Institute UK); Ben Rufus
Duckworth (Samsung R&D Institute UK); Shayan Joya (Samsung R&D
Institute UK); Nicolas Scotto Di
Perto (Samsung R&D Institute UK); Attila Dusnoki (University of Szeged);
Márkó Fabó (University of Szeged ); Dániel Vince (University of Szeged);
Gábor Lóki (University of Szeged ); Ákos Kiss (University of Szeged);
Christopher Alder (Samsung R&D Institute UK) |
W117 |
MULTI-MODAL
FUSION ENHANCED MODEL FOR DRIVER’S FACIAL EXPRESSION RECOGNITION |
Jianrong
Chen (University of California, San Diego)*; Sujit Dey (University of
California, San Diego); Lei Wang (Qualcomm); Ning Bi (Qualcomm); Peng Liu
(Qualcomm) |
|
|
|
|
|
|
|
|
|
O33 |
Image/video
acquisition and compression |
Time |
|
Chair |
Xin
Zhao (Tencent) |
ID |
Title |
Author |
698 |
Thousand
to One: Semantic Prior Modeling for Conceptual Coding |
Jianhui
Chang (Peking University)*; Zhenghui Zhao (Peking University); Lingbo Yang
(Peking University); Chuanmin Jia (Peking University); Jian Zhang (Peking
University Shenzhen Graduate School); Siwei Ma (Peking University, China) |
1063 |
Spatial-Temporal
Synergic Prior Driven Unfolding Network for Snapshot Compressive Imaging |
Zhuoyuan
Wu (PKU)*; Zhenyu Zhang (PKU); Jiechong Song (PKU); Jian Zhang (Peking
University Shenzhen Graduate School) |
1068 |
EFFICIENT
VIDEO COMPRESSED SENSING RECONSTRUCTION VIA EXPLOITING SPATIAL-TEMPORAL
CORRELATION WITH MEASUREMENT CONSTRAINT |
Zhichao
Wei (South China University of Technology)*; Chunling Yang (South China
University of Technology ); Yunyi Xuan (South China University of Technology) |
1369 |
Enhanced
Implicit Selection of Transform Skip in AVS3 |
liqiang
wang (Tencent)*; Xiaozhong Xu (Tencent America); Shan Liu (Tencent America) |
205 |
VANet:
A View Attention Guided Network for 3D Reconstruction from Single and
Multi-view Images |
Yi
Yuan (NetEase Fuxi AI Lab)*; Jilin Tang (NetEase Fuxi AI Lab); Zhengxia Zou
(University of Michigan) |
226 |
DIFFERENTIABLE
LIGHT-WEIGHT ARCHITECTURE SEARCH |
Yuxu
Mao (Ocean University of China); Guoqiang Zhong (Ocean University of China)*;
Yanan Wang (Ocean University of China); Zhaoyang Deng (Ocean University of
China) |
|
|
|
O34 |
Multimedia
analysis and understanding III |
Time |
|
Chair |
Ming-Ching
Chang (University at Albany - SUNY) |
ID |
Title |
Author |
1399 |
MPN:
Multimodal Parallel Network for Audio-Visual Event Localization |
Jiashuo
Yu (Fudan University)*; Ying Cheng (Fudan University); Rui Feng (Fudan
University) |
1488 |
Learning
Content and Context with Language Bias for Visual Question Answering |
Chao
Yang (Hunan University)*; Su Feng (Hunan University); Dongsheng Li (Microsoft
Research Asia); Huawei Shen (Institute of Computing Technology, Chinese
Academy of Sciences); Guoqing Wang (Hunan University); Bin Jiang (Hunan
University) |
627 |
Efficient
Human Pose Estimation by Learning Deeply Aggregated Representations |
Zhengxiong
Luo (Institute of Automation,Chinese Academy of Sciences)*; Zhicheng Wang
(Megvii); Yuanhao Cai (Tsinghua Univisity, Tsinghua Shenzhen International
Graduate School); Guan'an Wang (CASIA); Liang Wang (NLPR, China); Yan Huang
(Institute of Automation, Chinese Academy of Sciences); Erjin Zhou (Megvii
Research); Tieniu Tan (NLPR, China); Jian Sun (Megvii Technology) |
1180 |
An
Efficient Approach for Audio-Visual Emotion Recognition with Missing Labels
and Missing Modalities |
Fei
Ma (Tsinghua-Berkeley Shenzhen Institute, Tsinghua University)*; Shao-Lun
Huang (TBSI); Lin Zhang (Tsinghua University, China) |
1478 |
ConSK-GCN:
Conversational Semantic- and Knowledge-oriented Graph Convolutional Network for
Multimodal Emotion Recognition |
Yahui
Fu (Tianjin University)*; Shogo Okada (Japan Advanced Institute of Science
and Technology); Longbiao Wang (Tianjin University); Lili Guo (Tianjin
University); Yaodong Song (Tianjin University); Jiaxing Liu (Tianjin
University); Jianwu Dang (Tianjin University) |
|
|
|
|
|
|
O35 |
Special
Session: Advances in Language, Vision, and Limited Supervision |
Time |
|
Chair |
Yi
Cai (South China University of Technology) |
ID |
Title |
Author |
459 |
MNRE:
A Challenge Multimodal Dataset for Neural Relation Extraction with Visual
Evidence in Social Media Posts |
Changmeng
Zheng (South China University of Technology); Zhiwei Wu (School of Software
Engineering, South China University of Technology); Junhao Feng (South China
University of Technology); Ze Fu (School of Software Engineering, South China
University of Technology); Yi Cai (School of Software Engineering, South
China University of Technology)* |
585 |
MULTIMODAL
FUSION NETWORK WITH LATENT TOPIC MEMORY FOR RUMOR DETECTION |
jiaxin
chen (Guangdong University of Technology)*; Zekai Wu (Guangdong University of
Technology ); Zhenguo Yang (Guangdong University of Technology); Haoran Xie
(Lingnan University); Fu Lee Wang (The Open University of Hong Kong); Wenyin
Liu (Guangdong University of Technology) |
887 |
DCNet:
Dual-task Cycle Network for End-to-End Image Dehazing |
Zhihua
Chen (East China University of Science and Technology); Yu Zhou (East China
University of Science and Technology); Ping Li (The Hong Kong Polytechnic
University); Xiaoyu Chi (Goertek Co., Ltd); Lei Ma (Peking University); Bin
Sheng (Shanghai Jiao Tong University)* |
1394 |
Person
Retrieval in Physical World |
Wenxin
Huang (Hubei University)*; Dongyang Li (Wuhan University); Ruimin Hu (Wuhan
University); Chao Liang (Wuhan University); Xian Zhong (Wuhan University of
Technology) |
43 |
Image
Captioning with Inherent Sentiment |
tong
li (Beijing Institute of Technology)*; yunhui hu (Beijing Institute of
Technology); Xinxiao Wu (Beijing Institute of Technology) |
|
|
|
|
|
|
O36 |
Industry
and Application Track II |
Time |
|
Chair |
Lukas
Esterle (Aarhus University) |
ID |
Title |
Author |
W124 |
EXTENDED
GUIDED IMAGE FILTERING FOR CONTRAST ENHANCEMENT |
JIAFEI
WU (SenseTime Research)*; Gengjie Li (SenseTime Research); Chong Wang (Ningbo
University); Huakai Liu (SenseTime Research); shuai zhang (Sensetime Ltd);
Guangcheng Zhang (SenseTime Research) |
W129 |
Fine-Grained
Texture Identification for Reliable Product Traceability |
Junsong
Wang (Easy-Visible)*; Yubo Li (V-Origin Technology); ZhiYong Chang (V-Origin
Technology); Haitao Yue,(V-Origin Technology); Yonghua Lin (V-Origin
Technology) |
W132 |
A
LIGHTWEIGHT APPROACH FOR WOOD HYPERSPECTRAL IMAGES CLASSIFICATION |
Phyu
Phyu Htun (University of Computer Studies, Yangon.)*; Marco Boschetti
(Microtec srl GmbH); Attaullah Buriro (University of Bolzano); Roberto
Confalonieri (Free University of Bozen-Bolzano); Boyuan Sun (Free University
of Bolzano); Ah Nge Htwe (University of Computer Studies, Yangon.); Tammam
Tillo (Indraprastha Institute of Information Technology Delhi) |
W138 |
Low
Complexity Implementation of Intra String Copy in AVS3 |
Yingbin
Wang (Tencent); Xiaozhong Xu (Tencent America)*; Shan Liu (Tencent America) |
|
|
|