Details of Oral Session


O1 Image/Video Enhancement I
Time  
Chair Maggie Zhu (Purdue University)
ID Title Author
88 FGF-GAN: A Lightweight Generative Adversarial Network For Pansharpening Via Fast Guided Filter Zixiang Zhao (Xi’an Jiaotong University)*; Jiangshe Zhang (Xi'an Jiaotong University); Shuang Xu (Xi'an Jiaotong University); Kai Sun (Xi'an Jiaotong University); Lu Huang (Xi’an Jiaotong University); Junmin Liu (Xi'an Jiaotong University); Chunxia Zhang (Xi'an Jiaotong University)
253 Collaborative Reflectance-and-Illumination Learning for High-Efficient Low-light Image Enhancement Guijing Zhu (Dalian University of Technology); Long Ma (Dalian University of Technology); Risheng Liu (Dalian University of Technology)*; Xin Fan (Dalian University of Technology); Zhongxuan Luo (DALIAN UNIVERSITY OF TECHNOLOGY)
308 Organ-Branched-CNN for Robust Face Super-Resolution Jichun Li (Fudan University); Bahetiyaer Bare (Fudan University); Shili Zhou (Fudan University); Bo Yan (Fudan University)*; Ke Li (Fudan University)
350 Learning Long-Term Style Preserving Blind Video Temporal Consistency Hugo Thimonier (L'Oréal Research and Innovation)*; Julien Despois (L’Oréal Research and Innovation); Robin Kips (L'Oréal Research and Innovation); Matthieu Perrot ( L’Oréal Research and Innovation)
441 ISTA-Net++: Flexible Deep Unfolding Network for Compressive Sensing Di You (Peking University); Jingfen Xie (Peking University); Jian Zhang (Peking University Shenzhen Graduate School)*
456 Spatial Graph Convolutional Network for Image Super-Resolution Yue Yang (Xi’an Jiaotong University)*; Yong Qi (Xi’an Jiaotong University)
O2 Cross-modal and multi-modal multimedia analysis
Time  
Chair Bihan Wen (Nanyang Technological University)
ID Title Author
41 HIERARCHICAL REPRESENTATION NETWORK WITH AUXILIARY TASKS FOR VIDEOCAPTIONING Yu Lei (University of Electronic Science and Technology of China); Zhonghai He (UESTC)*; Pengpeng Zeng (University of Electronic Science and Technology of China); Jingkuan Song (UESTC); Lianli Gao (The University of Electronic Science and Technology of China)
115 Label-specific Alignment with Adversarial Multi-view Representation Yi Zhang (Nanjing University)*; Jundong Shen (Nanjing University); Cheng Yu ( Nanjing University); Chongjun Wang (Nanjing University)
214 Weakly-supervised Audio-visual Sound Source Detection and Separation Tanzila Rahman (University of British Columbia )*; Leonid Sigal (University of British Columbia)
799 Combine Early and Late Fusion Together: A Hybrid Fusion Framework for Image-Text Matching Yifan Wang (University of Electronic Science and Technology of China); Xing Xu (University of Electronic Science and Technology of China)*; Wei Yu (University of Electronic Science and Technology of China); Ruicong Xu (MEITUAN); Zuo Cao (MEITUAN); Heng Tao Shen (University of Electronic Science and Technology of China (UESTC))
1137 Tensor-based Multi-view Block-diagonal Structure Diffusion for Clustering Incomplete Multi-view Data Zhenglai Li (China University of Geosciences); Chang Tang (China University of Geosciences)*; Xinwang Liu (National University of Defense Technology); Xiao Zheng (National University of Defense Technology); Wei Zhang (Qilu University of Technology); En Zhu (National University of Defense Technology)
1389 Multi-Dimensional Attentive Hierarchical Graph Pooling Network for Video-Text Retrieval Dehao Wu (Peking University Shenzhen Graduate School)*; Yi Li (Peking University Shenzhen Graduate School); Yinghong Zhang (Peking University Shenzhen Graduate School); Yuesheng Zhu (Peking University Shenzhen Graduate School)
O3 Emerging applications of artificial intelligence
Time  
Chair Zhang Wei (Singapore Institute of Technology)
ID Title Author
566 Class Forge: Boosting Feature Encoder for Few-shot Learning with Synthesized Classes Rui-Qi Wang (Institute of Automation, Chinese Academy of Sciences)*; Xu-Yao Zhang (Institute of Automation of Chinese Academy of Sciences); Cheng-Lin Liu (Institute of Automation of Chinese Academy of Sciences)
568 GSS: Graph-based Subspace Learning with Shots Initialization for Few-shot Recognition Rui-Qi Wang (Institute of Automation, Chinese Academy of Sciences)*; Xu-Yao Zhang (Institute of Automation of Chinese Academy of Sciences); Cheng-Lin Liu (Institute of Automation of Chinese Academy of Sciences)
688 Truth Inference with Bipartite Attention Graph Neural Network from a Comprehensive View Jiacheng Liu (Shanghai Jiao Tong University); Feilong Tang (Shanghai Jiao Tong University)*; Jielong Huang (Alibaba Group)
714 Calibration for Non-exemplar based Class-incremental Learning Fei Zhu (Institute of Automation of Chinese Academy of Science)*; Xu-Yao Zhang (Institute of Automation of Chinese Academy of Sciences); Cheng-Lin Liu (Institute of Automation of Chinese Academy of Sciences)
746 Revisiting Graph Neural Networks for Node Classification in Heterogeneous Graphs Ye Tao (Peking University)*; Ying Li (Peking University); Zhonghai Wu (Peking University)
759 DDPER: Decentralized Distributed Prioritized Experience Replay Sidun Liu (NUDT); Peng Qiao (NUDT)*; Yong Dou (National University of Defense Technology); Rongchun Li (National Laboratory for Parallel and Distributed Processing, National University of Defense Technology,Changsha,Hunan)
O4 Multimedia databases and data mining
Time  
Chair Yueqi Duan (Stanford University)
ID Title Author
370 HAZY RE-ID: AN INTERFERENCE SUPPRESSION MODEL FOR DOMAIN ADAPTATION PERSON RE-IDENTIFICATION UNDER INCLEMENT WEATHER CONDITION Jian Pang (China University of Petroleum (East China)); Dacheng Zhang (Kunming University of Science and Technology); Huafeng Li (Kunming University of Science and Technology)*; Weifeng Liu (China University of Petroleum (East China)); Zhengtao Yu (Kunming University of Science and Technology)
440 Adaptive Deep Metric Ensemble Learning with Consensus Ping Li (Hangzhou Dianzi University)*; Guopan Zhao (Hangzhou Dianzi University); Huaxin Xiao (National University of Defense Technology)
682 Weakly-Supervised Online Hashing Yu-Wei Zhan (Shandong University); Xin  Luo (Shandong University)*; Yu Sun (Shandong University); Yongxin Wang (Shandong University); Zhen-Duo Chen (Shandong University); Xin-Shun Xu (Shandong University)
761 Deep Unsupervised Hashing by Distilled Smooth Guidance Xiao Luo (Peking University); Zeyu Ma (Harbin Institute of Technology, Shenzhen); Daqing Wu (Peking University); Huasong Zhong (Alibaba); Chong Chen (Alibaba); Jinwen Ma (Peking University); Minghua Deng (Peking University)*
647 Tensor-based Unsupervised Multi-view Feature Selection for Image Recognition Yongshan Zhang (China University of Geosciences)*; Xinxin Wang (China University of Geosciences); Zhihua Cai (China University of Geosciences); Yicong Zhou (University of Macau); Philip S Yu (UNIVERSITY OF ILLINOIS AT CHICAGO)
1129 Supervised Video Summarization via Multiple Feature Sets with Parallel Attention Junaid Ahmed Ghauri (TIB - Leibniz Information Centre for Science and Technology)*; Sherzod Hakimov (TIB - Leibniz Information Centre for Science and Technology); Ralph Ewerth (TIB - Leibniz Information Center for Science and Technology)
O5 Speech/audio synthesis and coding 
Time  
Chair Jahangir Alam (Computer Research Institute of Montreal)
ID Title Author
451 CROSS-DOMAIN SINGLE-CHANNEL SPEECH ENHANCEMENT MODEL WITH  BI-PROJECTION FUSION MODULE FOR NOISE-ROBUST ASR Fu-An Chao (National Taiwan Normal University)*; Jeih-weih Hung (National Chi Nan University); Berlin Chen (National Taiwan Normal University)
79 FastSVC: Fast Cross-Domain Singing Voice Conversion with Feature-wise Linear Modulation Songxiang LIU (The Chinese University of Hong Kong)*; Yuewen Cao (CUHK); Na Hu (Tencent); Dan Su (Tencent); Helen Meng (The Chinese University of Hong Kong)
709 Spatial audio object coding based on time-frequency shifting and scheduling Chenhao Hu (wuhan university); Ruimin Hu (Wuhan University)*; Xiaochen Wang (Wuhan University); Yulin Wu (Wuhan University)
711 LOW BITRATES AUDIO OBJECT CODING USING CONVOLUTIONAL AUTO-ENCODER AND DENSENET MIXTURE MODEL Yulin Wu (Wuhan University); Ruimin Hu (Wuhan University)*; Chenhao Hu (wuhan university); Shanfa Ke (Wuhan University); Gang Li (Wuhan University); Xiaochen Wang (Wuhan University)
1022 Efficient multi-step audio object coding with limited residual information Chenhao Hu (wuhan university); Ruimin Hu (Wuhan University)*; Xiaochen Wang (Wuhan University); Yulin Wu (Wuhan University); Wenke Liu (Wuhan University)
964 Deep Speaker Conditioning for Speech Emotion Recognition Andreas Triantafyllopoulos (audEERING GmbH / University of Augsburg)*; Shuo Liu (University of Augsburg); Björn Schuller (University of Augsburg)
O6 Special Session: Deep Learning for Multimedia Applications with Limited Supervision
Time  
Chair Joey Tianyi Zhou (National University of Singapore)
ID Title Author
107 Near Real Feature Generative Network for Generalized Zero-Shot Learning Jingren Liu (Nanjing University of Science and Technology); Haoyue Bai (Nanjing University of Science and Technology); Haofeng Zhang (Nanjing University of Science and Technology)*; Li Liu (the inception institute of artificial  intelligence)
124 Saliency-Guided Complementary Attention for Improved Few-Shot Learning Linglan Zhao (Shanghai Jiao Tong University)*; Ge Liu (Shanghai Jiao Tong University); Da-shan Guo (Shanghai Jiao Tong University); Wei Li (Shanghai Jiao Tong University); Xiangzhong Fang (Shanghai Jiao Tong University)
271 Unsupervised Video Person Re-identification via Noise and Hard frame Aware Clustering Pengyu Xie (Wuhan University of Science and Technology); Xin Xu (Wuhan University of Science and Technology)*; Zheng Wang (The University of Tokyo); Toshihiko Yamasaki (The University of Tokyo)
298 Dual-regularization Complementary Learning for Image Classification Lingjuan Ge (Wuhan University); Mingming Gong (University of Melbourne); Yutian Lin (Wuhan University)*; Bo Du (Wuhan University)
411 Multi-domain Synchronous Refinement Network for Unsupervised Cross-Domain Person Re-Identification Sikai Bai ( Northwestern Polytechnical University); Junyu Gao (Northwestern Polytechnical University, Center for OPTical IMagery Analysis and Learning); Qi Wang (Northwestern  Polytechnical University)*; Xuelong Li (Northwestern Polytechnical University)
675 Few-Shot Defect Segmentation Leveraging Abundant Defect-free Training Samples Through Normal Background Regularization and Crop-and-Paste Operation Dongyun Lin (Institute for Infocomm Research)*; Yanpeng Cao (ZJU); Wenbin Zhu (Zhejiang University); Yiqun Li (Institute for Infocomm Research)
O7  Multimedia activity analysis and understanding
Time  
Chair Zhiyong Wang (The University of Sydney)
ID Title Author
80 Relationship-aware Primal-Dual Graph Attention Network for Scene Graph Generation Hao Zhou (National University of Defense Technology); Tingjin Luo (College of Liberal Arts and Sciences, National University of Defense Technology)*; Jun Zhang (Science and Technology on Information Systems Engineering Laboratory, National University of Defense Technology); Jun Lei (National University of Defense Technology); Shuohao LI (College of Information System and Management, National University of Defense Technology)
100 PAL-Net: Predicate-Aware Learning Network for Visual Relationship Recognition Liang Xu (Shanghai Jiao Tong University); Yong-Lu Li (Shanghai Jiao Tong University); Mingyang Chen (Shanghai Jiaotong University); Yan Hao (Shanghai Jiao Tong University); Cewu Lu (Shanghai Jiao Tong University)*
215 DIVING INTO THE RELATIONS: LEVERAGING SEMANTIC AND VISUAL STRUCTURES FOR VIDEO MOMENT RETRIEVAL Ziyue Wu (Student)*; Junyu Gao (CASIA); Shucheng Huang (Jiangsu University of Science and Technology); Changsheng Xu (CASIA)
563 Multimodal-Semantic Context-Aware Graph Neural Network for Group Activity Recognition Tianshan Liu (The Hong Kong Polytechnic University)*; Rui Zhao (The Hong Kong Polytechnic University ); Kin-Man Lam (The Hong Kong Polytechnic University)
676 Temporally Coarse to Fine Snippets Relationship Learning with Graph Convolution for Temporal Action Proposal Generation Shuaicheng 1 Li (Fudan University)*; Rui-Wei Zhao (Fudan University); Shuyu Miao (Fudan University); Rui Feng (Fudan University)
906 Recurrent Graph Convolutional Autoencoder for Unsupervised Skeleton-Based Action Recognition Han Yao (Tongji University); S-J Zhao (HaiBa Technology)*; Chi Xie (Tongji University); Kenan Ye (Tongji University); Shuang Liang (Tongji University)
O8 Image/Video Enhancement II
Time  
Chair Bihan Wen (Nanyang Technological University)
ID Title Author
741 Structure-Resonant Discriminator for Image Super-Resolution Jaerin Lee (Seoul National University)*; Kyoung Mu Lee (Seoul National University)
846 Asymmetric Stereo Color Transfer Yicheng Wang (University of Science and Technology of China); Jiayong Peng (University of Science and Technology of China); Yueyi Zhang (University of Science and Technology of China); Shan Liu (Tencent America); Xiaoyan Sun (University of Science and Technology of China); Zhiwei Xiong (University of Science and Technology of China)*
878 Residual Attention Block Search for Lightweight Image Super-Resolution Wenrui Liao (HFUT); Zhong-Qiu Zhao (HFUT)*; Hao Shen (HFUT); Weidong Tian (HFUT)
893 HALDeR: Hierarchical Attention-guided Learning with Detail-refinement for Multi-Exposure Image Fusion Jinyuan Liu (Dalian University of Technology); JingJie Shang (Dalian University of Technology); Risheng Liu (Dalian University of Technology); Xin Fan (Dalian University of Technology)*
1020 Deep Deblocker Driven Adaptive Iteration Scheme for Compressed Image Recovery Chao Ren (Sichuan University)*; Xiaohai He (Sichuan University); Linbo Qing (Sichuan University, China); Yuanzhouhan Cao (Beijing Jiaotong University)
1094 Structure-Oriented Progressive  Low-rank Image Restoration for Defending Adversarial Attacks Zhiqun Zhao (University of Missouri-Columbia); Hengyou Wang (Beijing University of Civil Engineering and Architecture); HAO SUN (University of Missouri-Columbia); Wenming Cao (Shenzhen University); Zhihai He (University of Missouri Columbia)*
O9  Multimedia representation learning
Time  
Chair Wei-Ta Chu (National Cheng Kung University)
ID Title Author
9 Fine-Grained Image Retrieval via Multiple Part-level Feature Ensemble Gang Cao (Shenzhen University); Yingying Zhu (Shenzhen University)*; Xiufan Lu (Shenzhen University)
290 Cross-View Equivariant Auto-Encoder Zhibin Wan (School of Intelligence and Computing, Tianjin University); Changqing Zhang (Tianjin university)*; Yu Geng (Tianjin University); Huazhu Fu (Inception Institute of Artificial Intelligence); Xi Peng (College of Computer Science, Sichuan Univerisity); Pengfei Zhu (tianjin university); Qinghua Hu (Tianjin University)
469 Noise Homogenization via Multi-Channel Wavelet Filtering for High-Fidelity Sample Generation in GANs Shaoning Zeng (Yangtze Delta Region Institute (Hu Zhou), University of Electronic Science and Technology of China)*; Bob Zhang (Univerisity of Macau)
471  Semantically-Guided Disentangled Representation for Robust Gait Recognition Tianrui Chai (Beihang University)*; Xinyu Mei (Beihang University); Annan Li (Beijing University of Aeronautics and Astronautics); Yunhong Wang (State Key Laboratory of Virtual Reality Technology and System, Beihang University, Beijing 100191, China)
480 Self-Guided Deep Multi-view Subspace Clustering Network Beilei Cui (Dalian University of Technology); Hong Yu (Dalian University of Technology)*; Linlin Zong (Dalian University of Technology); Ziyang Cheng (Dalian University Of Technology)
624 Efficient Sketch Recognition via Compact Spatial Embedding Graph Neural Networks Hanhui Li (Nanyang Technological University)*; Xudong Jiang (Nanyang Technological University); boliang guan (Sun Yat-sen University); Nadia  Magnenat Thalmann (Nanyang Technological University)
O10 3D stereo computing
Time  
Chair Shuai Li (Shandong University)
ID Title Author
127 Disparity Estimation with Scene Depth Cues lei chen (tsinghua university)*; Zongqing Lu (Tsinghua University international Graduate School at Shenzhen); Qingmin Liao (Tsinghua Univeristy); Haoyu Ma (Tsinghua University); Jing-Hao Xue (University College London)
225 Learning Depth from Single Image using Depth-Aware Convolution and Stereo Knowledge Zhenyao Wu (University of South Carolina)*; Xinyi Wu (University of South Carolina); Xiaoping Zhang (Wuhan University); Song Wang (University of South Carolina); Lili Ju (University of South Carolina)
295 Fast Multi-Scale Residual Fusion Network for Stereo Matching Zijing Huang ( Peking University Shenzhen Graduate School); Jun Peng (Peking University Shenzhen Graduate School); Wangduo Xie (Peking University Shenzhen Graduate School); Qiuping Li (Peking University Shenzhen Graduate School); Yong Zhao (Peking University Shenzhen Graduate School)*
399 TAG-Reg: Iterative Accurate Global Registration Algorithm Biao Li (Xi'an Jiaotong University); Qixing Xie (Xi'an Jiaotong University); Shaoyi Du (Xi'an Jiaotong Unviersity)*; Wenting Cui (Xi'an Jiaotong University); Runzhao Yao (Xi'an Jiaotong University); Yue Gao (Tsinghua University); nanning zheng (Institute of Artificial Intelligence and Robotics, Xi'an Jiaotong University )
780 Better stereo matching from simple yet effective wrangling of deep features lei chen (tsinghua university)*; Zongqing Lu (Tsinghua University international Graduate School at Shenzhen); Qingmin Liao (Tsinghua Univeristy); Jing-Hao Xue (University College London)
1352 AUTOMATIC CHECKERBOARD DETECTION FOR ROBUST CAMERA CALIBRATION Ben Chen (Huazhong University of Science and Technology; Alibaba Group)*; Yuyao Liu (Huazhong University of Science and Technology); Caihua Xiong (School of Mechanical Science and Engineering, Huazhong University of Science and Technology)
O11 Multimedia for society and health
Time  
Chair Liping Chen (Microsoft)
ID Title Author
1053 Sample Efficient Lung Segmentation using Group structured Conditional Variational Data Imputation Yan Li (East China Normal University); Guitao Cao (East China Normal University)*; Wenming Cao (Shenzhen University)
261 Integrating Performance and Side Factors into Embeddings for Deep Learning-Based Knowledge Tracing Liangliang He (National University of Defense Technology)*
857 unsupervised domain adaptation based image synthesis and synergistic adversarial learning for optic disc and cup segmentation Weixin Liu (Shenzhe University); Haijun  Lei  (Shenzhen University); Hai Xie (Shenzhen University); Benjian Zhao (Shenzhen University); Baiying Lei (Shenzhen University)*
65 Let's Find Fluorescein: Cross-Modal Dual Attention Learning for Fluorescein Leakage Segmentation in Fundus Fluorescein Angiography Yang Wen (School of Computer Science and Engineering, University of Electronic Science and Technology of China); Leiting Chen (School of Computer Science and Engineering, University of Electronic Science and Technology of China); Lifeng Qiao (University of Electronic Science and Technology of China); Yu Deng (King's College London); Haisheng Chen (University of Electronic Science and Technology of China); Tian Zhang (School of Computer Science and Engineering, University of Electronic Science and Technology of China); Chuan Zhou (School of Computer Science and Engineering, University of Electronic Science and Technology of China)*
704 Shape-Adaptive Convolutional Operator for Breast Ultrasound Image Segmentation Kuan Huang (Utah State University); Yingtao Zhang (Harbin Institute of Technology); H. D. Cheng (Utah State University)*; Ping Xing (First Affiliated Hospital of Harbin Medical University)
941 Bias Field Poses a Threat to DNN-based X-Ray Recognition Binyu Tian (Tianjin University); Qing Guo (Nanyang Technological University)*; Felix Juefei-Xu (Alibaba Group, USA); Wen Le Chan (Nanyang Technological University); Yupeng Cheng (Nanyang Technological University, Singapore); Xiaohong  Li (Tianjin University); Xiaofei Xie (Nanyang Technological University); Shengchao Qin (Teesside University)
O12 Special Session: Advancd Video Coding and Deep Active Learning
Time  
Chair Hui Yuan (Shandong University)
ID Title Author
710 SPLIT UNIT CODING ORDER FOR VIDEO CODING Yinji Piao (Samsung Electronics)*; Kiho Choi (Gachon Univerisity); Min Woo Park (Samsung Electronics); Minsoo Park (Samsung Electronics); Kwang Pyo Choi (Samsung Electronics)
787 IMPROVED CHROMA FROM LUMA PREDICTION IN AV1 BASED ON VIRTUAL CHROMA BLOCK GENERATION Junyan Huo (Xidian University)*; Menglin Zhang (Xidian University); Wenhan Qiao (Xidian University); FuZheng Yang (Xidian University); Hui Su (Google Inc.);  Debargha Mukherjee (Google Inc)
904 ANGULAR WEIGHTED PREDICTION FOR NEXT-GENERATION VIDEO CODING STANDARD Yucheng Sun (Hikvision Research Institute); Fangdong Chen (Hikvision Research Institute); Li Wang (Hikvision Research Institute); Shiliang Pu (Hikvision Research Institute)*
223 Meta-Learning Causal Feature Selection for Stable Prediction Zhaoquan Yuan (School of Computing and Artificial Intelligence, Southwest Jiaotong University); Xiao Peng (Southwest Jiaotong University); Xiao Wu (Southwest Jiaotong University)*; Bingkun Bao (Nanjing University of Posts and Telecommunications ); Changsheng Xu (CASIA)
1244 Application of Leading Indicator Forecasting based on Optimal Transmission in Financial Technology Tao Yin (Shanghai Jiao Tong University); Zhexi Zhang (Shanghai Jiao Tong University ); Nianchi Zhang (East China Normal University); Ning Zhang (Shanghai Jiao Tong University)*
1541 Multi-scale Enhanced Active Learning for Skeleton-based Action Recognition Yuhan Zhang (University of Electronic Science and Technology of China)*; Zhiyu Zhao (Nanjing University); Wen Li (University of Electronic Science and Technology of China); Lixin Duan (University of Electronic Science and Technology of China)
O13 Emerging multimedia applications
Time  
Chair Zheng Wang (The University of Tokyo)
ID Title Author
257 Capturing Implicit Spatial Cues for Monocular 3D Hand Reconstruction Qi Wu (Institute of Intelligent Machines,Chinese Academy of Sciences); Joya Chen (University of Science and Technology of China); zhou xu (Hefei Institutes of Physical Science,China Academy of Science); ZhiMing Yao (Hefei Institutes of Physical Science, Chinese Academy of Sciences); Xianjun Yang (Hefei Institutes of Physical Science, Chinese Academy of Sciences)*
1186 Efficient and Accurate Hypergraph Matching Jian Hou (Dongguan University of Technology)*; Huaqiang Yuan (Dongguan University of Technology)
801 Zero-shot Multi-Focus Image Fusion Xingyu Hu (Harbin Institute of Technology)*; Junjun Jiang (Harbin Institute of Technology); Xianming Liu (Harbin Institute of Technology); Jiayi Ma (Wuhan University)
1187 Attentive Update of Multi-Critic for Deep Reinforcement Learning Qing Li (USTC)*; Wengang  Zhou (University of Science and Technology of China); Yun Zhou (University of Science and Technology of China); Houqiang Li (University of Science and Technology of China)
1347 Small object recognition using a spatio-temporal neural network Zhibo Liang (Harbin Institute of Technology)*; Shaohui Liu (Harbin Institute of Technology); Wuzhen Shi (Shenzhen University); Xingtao Wang (Harbin Institute of Technology; Peng Cheng Laboratory); Feng Jiang (Harbin Institute of Technology, Harbin)
1565 Person Retrieval with Conv-Transformer Shengsen Wu (Peking University)*; YAN BAI (Peking University); Ce Wang (Peking University); Lingyu Duan (Peking University)
O14  Multimedia semantic segmentation
Time  
Chair Duc Thanh Nguyen (Deakin University)
ID Title Author
642 MULTI-SCALE FEEDBACK FEATURE REFINEMENT U-NET FOR MEDICAL IMAGE SEGMENTATION Xiaofei Qin (University of Shanghai for Science and  Technology); Minmin Xu (University of Shanghai for Science and Technology); Chaoyang Zheng (University of Shanghai for Science and Technology); Changxiang He (University of Shanghai for Science and  Technology); Xuedian Zhang (University of Shanghai for Science and  Technology)*
898 Document Layout Analysis via Dynamic Residual Feature Fusion Xingjiao Wu (East China Normal University); ZiLing Hu (East China Normal University); Xiangcheng Du (East China Normal University); Jing Yang (ECNU)*; Liang He (ECNU)
1109 SEMI-SUPERVISED SEMANTIC SEGMENTATION VIA ENTROPY MINIMIZATION Jiawei Wu (Fujian Agriculture and Forestry University); Haoyi Fan (Harbin University of Science and Technology); Xiaoqing Zhang (Minjiang University); Shouying Lin (Fujian Agriculture and Forestry University); Zuoyong   Li (Minjiang University)*
1184 EFRNET: A LIGHTWEIGHT NETWORK WITH EFFICIENT FEATURE FUSION AND REFINEMENT FOR REAL-TIME SEMANTIC SEGMENTATION Kuayue Zhang (Tsinghua University); Qingmin Liao (Tsinghua Univeristy); Juncheng Zhang (Tsinghua University); Shaojun Liu (Hong Kong University of Science and Technology)*; Haoyu Ma (Tsinghua University); Jing-Hao Xue (University College London)
1205 Weakly-Supervised Attribute Segmentation Guangzhen Liu (Renmin University of China); Zhiwu Lu (Renmin University of China)*
1518 CONFIDENCE-GUIDED ADAPTIVE GATE AND DUAL DIFFERENTIAL ENHANCEMENT FOR VIDEO SALIENT OBJECT DETECTION Pei-Jia Chen (Sun Yat-sen University); Jian-Huang Lai (Sun Yat-sen University)*; Guangcong Wang (Sun Yat-Sen University); Huajun Zhou (Sun Yat-sen University)
O15  Image/Video Synthesis and Creation I
Time  
Chair Tsung-Wei Huang (Dolby Labs)
ID Title Author
45 Semantic-Aware Video Color Style Transfer based on Temporal Consistent Sparse Patch Constraint Yaxin Liu (College of Computer Science and Software Engineering, Shenzhen University); Xiaoyan Zhang (College of Computer Science and Software Engineering, Shenzhen University)*; Xiaogang XU (The Chinese University of Hong Kong)
119 Learnable Sampling 3D Convolution for video enhancement and action recognition Shuyang Gu (University of Science and Technology of China)*; Jianmin Bao (Microsoft Research Asia); Dong Chen (Microsoft Research Asia)
137 ASTM: An Attention based SpatioTemporal Model for Video Prediction Using 3D Convolutional Neural Networks Zheng Chang (University of Chinese Academy of Sciences )*; xinfeng zhang (University of Chinese Academy of Sciences); Shanshe Wang (Peking University); Siwei Ma (Peking University, China); Yan Ye (Alibaba Inc.); Wen Gao (PKU)
191 Adversarial Adaptive Interpolation for Regularizing Representation Learning and Image Synthesis in AutoEncoders Guanyue Li (SCUT); Xiwen Wei (South China University of Technology); Sheng Qian (Huawei Device Company Limited); Si Wu (South China University of Technology)*; Zhiwen Yu (South China University of Technology); Hau San Wong (City University of Hong Kong)
220 Real-time Masked Face Revealing for Video Conference Jinpeng Lin (XiaMenUniversity); Pengfei Liu (School of Informatics, Xiamen University); Yinglin Zheng (School of Informatics, Xiamen University); Wenjin Deng (School of Informatics, Xiamen University); Ming Zeng (School of Informatics, Xiamen University)*
245 LI-NET: LARGE-POSE IDENTITY-PRESERVING FACE REENACTMENT NETWORK Jin Liu (1. Institute of Information Engineering,Chinese Academy of Sciences. 2. School of Cyber Security, University of Chinese Academy of Sciences); Peng Chen (1. Institute of Information Engineering,Chinese Academy of Sciences. 2. School of Cyber Security, University of Chinese Academy of Sciences); Tao Liang (1. Institute of Information Engineering,Chinese Academy of Sciences. 2. School of Cyber Security, University of Chinese Academy of Sciences); Zhaoxing Li (Institute of Information Engineering,Chinese Academy of Sciences); Cai Yu (1. Institute of Information Engineering,Chinese Academy of Sciences. 2. School of Cyber Security, University of Chinese Academy of Sciences); Shuqiao Zou (1. Institute of Information Engineering,Chinese Academy of Sciences. 2. School of Cyber Security, University of Chinese Academy of Sciences); Jiao Dai (Institute of Information Engineering,Chinese Academy of Sciences)*; Jizhong Han (Institute of Information Engineering,Chinese Academy of Sciences)
O16 Object/Person detection, Tracking and Recognition I
Time  
Chair Chunjie Zhang (Beijing Jiaotong University)
ID Title Author
72 PMAE: PSEUDO MULTI-LABEL ATTENTION ENSEMBLE Xueman Wang (Tiangong University); Ling Du (Tiangong University)*; Junbing Li (Tianjin University)
102 Improving Facial Attribute Recognition by Group and Graph Learning Zhenghao Chen (University of Sydney)*; Shuhang Gu (ETH Zurich, Switzerland); Feng Zhu (Sensetime Group Limited); Jing Xu (Sensetime Group Limited); Rui Zhao (Sensetime Group Limited)
144 DSIC: Dynamic Sample-Individualized Connector for Multi-scale Object Detection Zekun Li (Institute of automation, Chinese Academy of Sciences); Yufan Liu (Institute of Automation, Chinese Academy Sciences); Bing Li (National Laboratory of Pattern Recognition (NLPR), Institute of Automation, Chinese Academy of Sciences)*; Weiming Hu (Institute of Automation,Chinese Academy of Sciences); Yanan Miao (CNCERT); Hong Zhang (CNCERT)
368 Object Decoupling with Graph Correlation for Fine-Grained Image Classification Qiushi Guo (Alibaba Group)*; Mingchen Zhuge (China University of Geosciences); Dehong Gao (Alibaba Group); Huiling Zhou (Alibaba); Xin Wang (Alibaba Group); Xiaonan Meng (Alibaba Group)
428 Exploring Driving-aware Salient Object Detection via Knowledge Transfer Jinming Su (Beihang University); Changqun Xia (Peng Cheng Laboratory)*; Jia Li (Beihang University)
489 Hands-on Guidance for Distilling Object Detectors Yangyang Qin (Huazhong University of Science and Technology)*; Hefei Ling (Huazhong University of Science and Technology); Zhenghai He (Huazhong University of Science and Technology); Yuxuan Shi (Huazhong University of Science and Technology); Lei Wu (Huazhong University of Science and Technology)
O17 Emerging multimedia applications of deep learning I
Time  
Chair Wei Qi Yan (Auckland University of Technology)
ID Title Author
169 Enhancing Adversarial Examples Via Self-Augmentation Lifeng Huang (SunYat-sen university)*; Chengying Gao (Sun Yat-sen University ); Wenzi Zhuang (Sun Yat-sen University); Ning Liu (Sun Yat-sen University )
178 Unsupervised ensemble learning via network generation Zhongfan Zhang (South China University of Technology); Wenming CAO (The University of Hong Kong)*; Cheng Liu (Shantou University); Rui Li (City University of Hong Kong); Qianfen Jiao (City University of Hong Kong); Zhiwen Yu (South China University of Technology); C. L. Philip Chen  (South China University of Technology); Hau San Wong (City University of Hong Kong)
335 Learning to transfer under unknown noisy environments: an universal weakly-supervised domain adaptation method Xuan Liu (Hunan University); Ying Huang (Hunan University)*; Shichang He (Hunan University); Jiangjin Yin (Hunan University); Xinning Chen (Hunan University); Shigeng Zhang (Central South University)
651 Efficient training of lightweight neural networks using Online Self-Acquired Knowledge Distillation Maria Tzelepi (Aristotle University of Thessaloniki)*; ANASTASIOS TEFAS (Aristotle University of Thessaloniki)
763 Flexible Knowledge Distillation with an Evolutional Network Population Jie Lei (Zhejiang University Of Technology); Zhao Liu (Ping An Life Insurance Of China, Ltd.)*; Mingli Song (Zhejiang University); Juan Xu (Pingan Life Insurance of China); Jianping Shen (PingAn Life Insurance of China); Ronghua Liang (Zhejiang University of Technology)
838 Cooperative Learning for Noisy Supervision Hao Wu (Cooperative Medianet Innovation Center, Shanghai Jiao Tong University)*; Jiangchao Yao (Damo Academy, Alibaba Group); Ya Zhang (Cooperative Medianet Innovation Center, Shang hai Jiao Tong University); Yan-Feng Wang (Cooperative medianet innovation center of Shanghai Jiao Tong University)
O18  Multimedia security, privacy and forensic I
Time  
Chair Jun Wan (NLPR, CASIA)
ID Title Author
645 Multi-task Wavelet Corrected Network For Image Splicing Forgery Detection and Localization Xiuli Bi (Chongqing University of Posts and Telecommunications); Zhang Zhipeng (Chongqing university of post and telecommunications); Liu Yanbin (Chongqing University of Posts and Telecommunications); bin xiao (Chongqing University of Posts and Telecommunications)*; Weisheng  Li (Chongqing University of Posts and Telecommunications)
1454 Multi-Modality Image Manipulation Detection Chao Yang (Hunan University)*; Zhiyu Wang (Hunan University); Huawei Shen (Institute of Computing Technology, Chinese Academy of Sciences); Huizhou Li (Hunan University); Bin Jiang (Hunan University)
200 Video Abnormal Event Detection via Context Cueing Generative Adversarial Network Zhi Zhang (Shenzhen University); Sheng-hua Zhong (Shenzhen University)*; Yan Liu (The Hong Kong Polytechnic University)
247 Leveraging Intra-domain Knowledge to Strengthen Cross-domain Crowd Counting Yiqing Cai (East China Normal University); Lianggangxu Chen (East China Normal University); Zhenwei Ma (The Third Research Institute Of Ministry Of Public Security); Changhong lu (East China Normal University); Changbo Wang (East China Normal University); Gaoqi He (East China Normal University)*
282 DISCRIMINATIVE AND GEOMETRICALLY ROBUST ZERO-WATERMARKING SCHEME FOR PROTECTING DIBR 3D VIDEOS Xiyao Liu (Central South University); Yayun Zhang (Central South University); Sibo Du (Central South University); Jian Zhang (Central South University)*; Ming  Jiang ( Guilin University of Electronic Technology); Hui Fang (Loughborough University)
1037 H-StegoNet: A Hybrid Deep Learning Framework for Robust Steganalysis Soumik Mondal (A*STAR)*; Yeo  Sze Ling  (ASTAR-Institute for Infocomm Research, A*STAR); ArulMurugan Ambikapathi (ASTAR-Institute for Infocomm Research, A*STAR)
O19 Special Session: Advanced Representation Learning for Robust Multimedia Image Understanding
Time  
Chair Guangwei Gao (Nanjing University of Posts and Telecommunications)
ID Title Author
383 Learning Homogeneous and Heterogeneous Co-Occurrences for Unsupervised Cross-modal Retrieval Yang Zhao (Nanjing University of Science and Technology); Weiwei Wang (Nanjing University of Science and Technology); Haofeng Zhang (Nanjing University of Science and Technology)*; BingZhang Hu (Newcastle University)
643 Multimodal Transformer Networks with Latent Interaction for Audio-Visual Event Localization Yixuan He (University of Electronic Science and Technology of China); Xing Xu (University of Electronic Science and Technology of China)*; Xin Liu (Huaqiao University); Weihua Ou (Guizhou Normal University); Huimin Lu (Kyushu Institute of Technology)
921 Disentangling Prototype and Variation for Single Sample Face Recognition MENG PANG (Nanyang Technological University); Binghui Wang (Duke University); Mang YE (Wuhan University); Yiran Chen (Duke University); Bihan Wen (Nanyang Technological University)*
1178 Transferable Feature Learning on Graphs Across Visual Domains Ronghang Zhu (University of Georgia)*; Xiaodong Jiang (Facebook Inc); Jiasen Lu (Allen Institute for AI); Sheng Li (University of Georgia)
1452 Face Super-Resolution through Dual-identity Constraint Fangfang Cheng (Wuhan Institute of Technology)*; Tao Lu (Wuhan Institute of Technology); Yu Wang (Wuhan Institute of technology); Yanduo Zhang (Wuhan Institute of Technology)
O20 Multimedia Applications I
Time  
Chair Yongshan Zhang (University of Macau)
ID Title Author
908 DGD-NET: LOCAL DESCRIPTOR GUIDED KEYPOINT DETECTION NETWORK Xiaotao Liu (Tianjin University); Chen Meng (College of Intelligence and Computing, Tianjin University, China); Fei-Peng  Tian (Tianjin University); Wei Feng (College of Intelligence and Computing, Tianjin University, China)*
1335 Multi-view Tensor Clustering through Exploiting both Within-view and Across-view High-order Correlations haiyan wang (South China University of Technology); Guoqiang Han (South China University of Technology); Yu Hu (South China University of Technology); Hong Peng (South China University of Technology); Jiazhou Chen (South China University of Technology); Bin Zhang (South China University of Technology); Hongmin Cai (South China University of Technology)*
1484 Path Ranking Model For Entity Prediction xiao long (USTC); MingHong Yao (University of Science and Technology of China); Liansheng Zhuang (University of Science and Technology of China)*; Houqiang Li (University of Science and Technology of China)
1057 Learning efficient rotation representation for point cloud via local-global aggregation Ruibin Gu (South China University of Technology); Qiuxia Wu (South China University of Technology, China)*; Hongbin Xu (South China University of Technology); Wing W.Y. Ng (South China University of Technology); Zhiyong Wang (The University of Sydney)
371 Model Compression via Collaborative Data-free Knowledge Distillation for Edge Intelligence Zhiwei Hao (Beijing Institute of Technology)*; Yong Luo (Wuhan University); Zhi Wang (Tsinghua University); Han Hu (Beijing Institute of Technology, China); Jianping An (Beijing Institute of Technology)
O21 Object/Person detection, Tracking and Recognition II
Time    
Chair Yu Zhou (Institute of Information Engineering, CAS)
ID Title Author
547 Multi-view Face Recognition using Deep Attention-based Face Frontalization Xiao-Hu Shao (Chongqing Institute of Green and Intelligent Technology,Chinese Academy of Sciences; University of Chinese Academy of Sciences)*; Junliang Xing (Institute of Automation, Chinese Academy of Sciences); Ruihan Pan (Chongqing Institute of Green and Intelligent Technology, Chinese Academy of Sciences); Zhenghao Li (Chongqing Institute of Green and Intelligent Technology, Chinese Academy of Sciences); Xiang-Dong Zhou (Chongqing Institute of Green and Intelligent Technology,Chinese Academy of Sciences); Yu Shi (Chongqing Institute of Green and Intelligent Technology,Chinese Academy of Sciences)
899 CORE-Text: Improving Scene Text Detection with Contrastive Relational Reasoning Jingyang Lin (Sun Yat-Sen University); Yingwei Pan (JD AI Research)*; Rongfeng Lai (JD AI Research); Xuehang Yang (JD AI Research); Hongyang Chao (Sun Yat-sen University); Ting Yao (JD AI Research)
923 SSDL: Self-Supervised Dictionary Learning Shuai Shao (China University of Petroleum (East China) College of Control Science and Engineering); Lei Xing (China University of Petroleum(East China) College of Oceanography and Space Informatics); wei yu (Harbin Institute of Technology, School of computer science and technology); Rui Xu (China University of Petroleum (East China) College of Control Science and Engineering); yanjiang wang (China University of Petroleum (East China) College  of Control Science and Engineering); baodi liu (China University of Petroleum (East China) College of Information and Control Engineering)*
974 DeepMix: Online Auto Data Augmentation for Robust Visual Object Tracking Ziyi Cheng (Kyushu University); Xuhong Ren (School of Computer Science and Engineering, Tianjin University of Technology); Felix Juefei-Xu (Alibaba Group, USA); Wanli Xue (Tianjin University of Technology)*; Qing Guo (Nanyang Technological University); Lei Ma (University of Alberta); Jianjun Zhao (Kyushu University)
1006 MATTING ENHANCED MASK R-CNN Lufan Ma (Tsinghua University)*; Bin Dong (Southeast University); Jiangpeng Yan (Tsinghua University); Xiu Li (Tsinghua University)
1036 DEEP CORRELATION FILTERS FOR ROBUST VISUAL TRACKING Xiang Liu (Dongguan University  of  Technology)*
O22  Image/Video Synthesis and Creation II
Time  
Chair Ming-Ching Chang (University at Albany - SUNY)
ID Title Author
403 STAE: A SpatioTemporal Auto-Encoder for High-Resolution Video Prediction Zheng Chang (University of Chinese Academy of Sciences )*; xinfeng zhang (University of Chinese Academy of Sciences); Shanshe Wang (Peking University); Siwei Ma (Peking University, China); Yan Ye (Alibaba Inc.); Wen Gao (PKU)
439 FEW-SHOT KNOWLEDGE TRANSFER FOR FINE-GRAINED CARTOON FACE GENERATION Nan Zhuang (Peking University)*; Cheng Yang (ByteDance Inc.)
817 BargainNet: Background-Guided Domain Translation for Image Harmonization Wenyan Cong (Shanghai Jiao Tong University); Li Niu (Shanghai Jiao Tong University)*; Jianfu Zhang (RIKEN AIP;Shanghai Jiao Tong University); Jing Liang (Shanghai Jiao Tong University); Liqing Zhang (Shanghai Jiao Tong University)
1160 DNA-NET: AGE AND GENDER AWARE KIN FACE SYNTHESIZER Pengyu Gao (Southeast University); Joseph P Robinson (Northeastern University); Jiaxuan Zhu (Southeast University); Chao Xia (Shanghai Jiao Tong University); Ming Shao (University of Massachusetts Dartmouth); Siyu Xia (Southeast University, China)*
1163 Spatial Content Alignment For Pose Transfer Wing Yin Yu (CITY UNIVERSITY OF HONG KONG)*; Lai-Man Po (CITY UNIVERSITY OF HONG KONG); Yuzhi Zhao (City University of Hong Kong); Jingjing Xiong (CITY UNIVERSITY OF HONG KONG); Kin Wai Lau (CITYU UNIVERSITY OF HONG KONG)
1339 INFRARED AND VISIBLE IMAGE FUSION BASED ON MODAL FEATURE FUSION NETWORK AND DUAL VISUAL DECISION Yong Yang (School of Information Technology, Jiangxi University of Finance and Economics); Jiaxiang Liu (School of Information Technology, Jiangxi University of Finance and Economics)*; Shuying Huang (School of Software and Communication Engineering, Jiangxi University of Finance and Economics); Weiguo Wan (School of Software and Communication Engineering, Jiangxi University of Finance and Economics); Xiangkai Kong (School of Information Technology, Jiangxi University of Finance and Economics); Wang Zhang ( School of Information Technology, Jiangxi University of Finance and Economics)
O23  Multimedia analysis and understanding I
Time  
Chair Bingpeng Ma (University of Chinese Academy of Sciences)
ID Title Author
937 Cross-scene Person Trajectory Anomaly Detection Based on Re-Identification Yuanxun Li (Sun Yat-sen University, China); Ancong Wu (Sun Yat-sen University); WEI-SHI ZHENG (Sun Yat-sen University, China)*
1073 ACTION PREDICTION NETWORK WITH AUXILIARY OBSERVATION RATIO REGRESSION Cuiwei Liu (Shenyang Aerospace University)*; Yiming Gao (Shenyang Aerospace University); Zhaokui Li (Shenyang Aerospace University); Chong Du (Shenyang Aircraft Design and Research Institute); Fang Liu (Shenyang Aerospace University;Northeastern University); Xiangbin Shi (Shenyang Aerospace University)
1082 GAIT IDENTIFICATION BASED ON HUMAN SKELETON WITH PAIRWISE GRAPH CONVOLUTIONAL NETWORK Ke Xu (Shanghai Jiao Tong University)*; Xinghao Jiang (Shanghai Jiao Tong University); Tanfeng Sun (Shanghai Jiao Tong University)
1119 spatial reasoning and context-aware attention network for skeleton-based action recognition Dianlong You (yanshan university); Ling Wang (yanshan university)*; Da Han (Cardiff University); Shunpan Liang (yanshan university); Hongyang Liu (yanshan university); Fuyong Yuan (yanshan university)
1525 Edge Enhancement Network for Weakly Supervised Semantic Segmentation Mei Yu (Tianjin University); Junbin Wei (Tianjin University); Chenhan Wang ( Laboratory of OpenBayes Machine Intelligence Lab); Han Jiang (Laboratory of OpenBayes Machine Intelligence Lab); Jian Yu (Tianjin University); Ruixuan Zhang (College of Intelligence and Computing, Tianjin University); Xuewei Li (Tianjin University)*; Ruiguo Yu (Tianjin University)
1587 Associative Segmentation for Instances and Semantics by perceiving neighborhood in Point Clouds Yingying Zhu (Shenzhen University); Biao Li (Shenzhen University); Qiang Huang (Shenzhen University)*
O24 Multimedia interaction & Multimedia quality assessment
Time  
Chair Jong-Seok LEE (Yonsei University)
ID Title Author
959 FINE-GRAINED DISCOURSE FOR METAPHOR DETECTION qimeng yang (xinjiang university)*; Long Yu (Xinjiang University); Shengwei Tian (Xinjiang University); jinmiao song (Xinjiang University)
1128 Facial Chirality: Using self-face reflection to learn discriminative features for facial expression recognition Ling Lo ( National Chiao Tung University); Hong Xia Xie (National Chiao Tung University); Hong-Han Shuai (National Chiao Tung University); Wen-Huang Cheng (National Chiao Tung University)*
106 SKANET: STRUCTURED KNOWLEDGE-AWARE NETWORK FOR VISUAL DIALOG Lei Zhao (The University of Electronic Science and Technology of China); Lianli Gao (The University of Electronic Science and Technology of China)*; Yuyu Guo (UESTC); Jingkuan Song (UESTC); Heng Tao Shen (University of Electronic Science and Technology of China (UESTC))
755 A No-reference Evaluation Metric for Low-light Image Enhancement Zicheng Zhang (Shanghai Jiaotong university)*; Wei Sun (Shanghai Jiao Tong Unviersity); Xiongkuo Min (Shanghai Jiao Tong University); Wenhan Zhu (Shanghai Jiao Tong University); Tao Wang (ShanghaiJiaotongUniversity); Wei Lu (Shanghai Jiao Tong University); Guangtao Zhai (Shanghai Jiao Tong University)
1158 DEEP NEURAL NETWORKS FOR END-TO-END SPATIOTEMPORAL VIDEO QUALITY PREDICTION AND AGGREGATION Junming Chen (Peking University); Haiqiang Wang (Pengcheng Laboratory); Munan Xu (Shenzhen Graduate School, Peking University); Ge Li (SECE, Shenzhen Graduate School, Peking University)*; Shan Liu (Tencent America)
1465 No-Reference Deep Quality Assessment of Compressed Light Field Images Zixuan Guo (Peking University); Wei Gao (Peking University & Peng Cheng Laboratory)*; Haiqiang Wang (Pengcheng Laboratory); Junle Wang (Tencent); Songlin Fan (Peking University )
O25  Multimedia security, privacy and forensic II
Time  
Chair Liang He (Tsinghua University)
ID Title Author
18 Blind Adversarial Pruning: Towards the Comprehensive Robust Models with Gradually Pruning Against Blind Adversarial Attacks Haidong Xie (Qian Xuesen Laboratory, China Academy of Space Technology); Lixin Qian ( Wuhan University of Technology); Xueshuang Xiang (Qian Xuesen Laboratory of Space Technology)*; Naijin Liu (Qian Xuesen Laboratory, China Academy of Space Technology)
695 EFFICIENT OPEN-SET ADVERSARIAL ATTACKS ON DEEP FACE RECOGNITION Haojie Yuan (University of Science and Technology of China); Qi Chu (University of Science and Technology of China)*; Feng Zhu (University of Science and Technology of China); Rui Zhao (SenseTime Group Limited); Bin Liu (University of Science and Technology of China); Nenghai Yu (University of Science and Technology of China)
920 CONTENT-INDEPENDENT ONLINE HANDWRITING VERIFICATION BASED ON MULTI-MODAL FUSION Nan Ji (School of Cyberspace Security, University of Science and Technology of China); Bin Liu (University of Science and Technology of China)*; Zhiwei Zhao (University of Science and Technology of China); Yan Lu (University of Sydney); Qi Chu (University of Science and Technology of China); Zhenchao Jin (University of Science and Technology of China); Nenghai Yu (University of Science and Technology of China)
1324 On Generating JPEG Adversarial Images Mengte Shi (Fudan University); Sheng Li (Fudan University); Zhaoxia Yin (Anhui University); Xinpeng Zhang (School of Computer Science, Fudan University)*; Zhenxing Qian (School of Computer Science, Fudan University)
1459 Transferable Adversarial Examples for Anchor Free Object Detection quanyu liao (Chengdu University of Information Technology); Xin Wang (Keya Medical); bin kong (curacloud); Siwei Lyu (University at Buffalo); Bin Zhu (Microsoft Research Asia); youbing yin (Curacloud); qi  song (Curacloud); Xi Wu (Chengdu University of Information Technology)*
O26 Special Session: Recent Advance in Depth-Related Processing and Applications
Time  
Chair Runmin Cong (Beijing Jiaotong University)
ID Title Author
105 SN-Graph: a Minimalist 3D Object Representation for Classification Siyu Zhang (Donghua University); Hui Cao (Donghua University); Yuqi Liu (Donghua University); Shen Cai (Donghua University)*; Yanting Zhang (Donghua University); Yuanzhan Li (Donghua University); Xiaoyu Chi (Goertek Co., Ltd)
259 Stereo Superpixel Segmentation via Dual-attention Fusion Networks Ruiqi Wu (Wuhan University of Technology); Yajuan Du (Wuhan University of Technology); Hua Li (Huazhong University of Science and Technology; City University of Hong Kong)*; Yucong Dai (Wuhan University of Technology)
278 IRS: A Large Naturalistic Indoor Robotics Stereo Dataset to Train Deep Models for Disparity and Surface Normal Estimation Qiang Wang (Hong Kong Baptist University)*; Shizhen Zheng (HKBU); Qingsong Yan (Wuhan University); Fei Deng (Wuhan University); Kaiyong Zhao (Hong Kong Baptist University); Xiaowen Chu (Hong Kong Baptist University)
1162 QoE-based Neural Live Streaming Method With Continuous Dynamic Adaptive Video Quality Control Xuekai WEI (City University of Hong Kong); Mingliang Zhou (Chongqing University)*; Sam Kwong (City Univeristy of Hong Kong); Hui Yuan (Shandong University); Tao Xiang (Chongqing University)
1345 DUAL REGULARIZATION BASED DEPTH MAP SUPER-RESOLUTION WITH GRAPH LAPLACIAN PRIOR Longhua Sun (Beijing University of Technology); Jin Wang (Beijing University of Technology)*; Ruiqin Xiong (Peking University); Yunhui Shi (Beijing University of Technology); Qing Zhu (Beijing University of Technology); Baocai  Yin (Beijing University of Technology)
O27 Image/Video Enhancement III
Time  
Chair Chau-Wai Wong (North Carolina State University)
ID Title Author
1174 Image demoireing with a dual-domain distilling network Hailing Wang (Tianjin University); Qiaoyu Tian (Tianjin University); Liang Li (Tianjin University)*; Xiaojie Guo (Tianjin University)
1176 Contrastive Feature Decomposition for Image Reflection Removal Xin Feng (Harbin Institute of Technology, Shenzhen); Haobo Ji (Harbin Institute of Technology,Shenzhen); Bo Jiang (Harbin Institute of Technology Shenzhen); Wenjie Pei (Harbin Institute of Technology, Shenzhen); Fanglin Chen ( Harbin Institute of Technology, Shenzhen); Guangming Lu ( Harbin Institute of Technology, Shenzhen)*
1189 RGB GUIDED DEPTH MAP SUPER-RESOLUTION WITH COUPLED U-NET Yingjie Cui (Tsinghua University); Qingmin Liao (Tsinghua Univeristy)*; Wenming Yang (Tsinghua University); Jing-Hao Xue (University College London)
1374 Blur Invariant Kernel-Adaptive Network for Single Image Blind Deblurring Sungkwon An (Seoul National University ); Hyungmin Roh (Seoul National University); Myungjoo Kang (Seoul National University)*
1373 STRUCTURAL PRIOR GUIDED IMAGE INPAINTING FOR COMPLEX SCENE Shuxin Wei (Sun Yat-sen University); Chengying Gao (Sun Yat-sen University )*
1517 BWIN: A Bilateral Warping Method for Video Frame Interpolation Fanyong Xue (Shanghai Jiao Tong University); Jie Li (Shanghai Jiao Tong University)*; Jiannan Liu (Shanghai Jiao Tong University); Chentao Wu (Shanghai Jiao Tong University)
O28  Multimedia analysis and understanding II
Time  
Chair Liping Chen (Microsoft)
ID Title Author
953 A lightweight Saliency Prediction Model for Omnidirectional Images dandan zhu (Shanghai Jiao Tong University)*; yongqing chen ( Hainan Air Traffic Management Sub-Bureau); Defang Zhao (Tongji University); Xiongkuo Min (Shanghai Jiao Tong University); Qiangqiang Zhou (Jiangxi Normal University); Shaobo Yu (East China Normal University); Guangtao Zhai (Shanghai Jiao Tong University); Xiaokang Yang (Shanghai Jiao Tong University)
1277 Multi-Scale Attention Constraint Network for Fine-Grained Visual Classification Yaqing Hou (Dalian University of Technology)*; zhang wenkai (Dalian University of Technology); dongsheng zhou (dlu.edu.cn); Hongwei Ge (Dalian University of Technology); Qiang Zhang (Dalian University of Technology); Xiaopeng Wei (Dalian University of Technology)
1314 Multiple Hub-driven Attention Graph Network for Scene Graph Generation Yang Yao (Sun Yat-sen University)*; Bo Gu (Sun Yat-sen University)
1105 HRDNet: High-resolution Detection Network for Small Objects Ziming Liu (Inria); Guangyu Ryan Gao (Beijing Institute of Technology)*; Lin Sun (Samsung, USA); zhiyuan fang (Beijing Institute of Technology )
1157 Meta-Graph Adaptation for Visual Object Tracking Qiangqiang Wu (City University of Hong Kong); Antoni Chan (City University of Hong Kong, Hong, Kong)*
1288 CUTMIX DUAL BRANCH NETWORK FOR PERSON RE-IDENTIFICATION Zengming Tang (Shanghai Advanced Research Institute, Chinese Academy of Sciences, Shanghai, China)*; Jun Huang (Shanghai Advanced Research Institute, Chinese Academy of Sciences)
O29 Emerging multimedia applications of deep learning II
Time    
Chair Maggie Zhu (Purdue University)
ID Title Author
895 DEEP TIERED IMAGE SEGMENTATION FOR DETECTING INTERNAL ICE LAYERS IN RADAR IMAGERY Yuchen Wang (Indiana University)*; Mingze Xu (Amazon); John Paden (University of Kansas); Lora Koenig (Univeristy of Colorado); Geoffrey  Charles Fox (Indiana University); David Crandall (Indiana University)
947 ATTENTION DRIVEN SELF-SIMILARITY CAPTURE FOR MOTION DEBLURRING Jie Zhang (School of Computer Science, Fudan University); Chuanfa Zhang (Fudan University); Jiangzhou Wang (School of Computer Science, Fudan University); Qingyue Xiong (Fudan University); Yingtao Zhang (School of Computer Science, Fudan University); Wenqiang Zhang (Fudan University)*
986 Wide-sense Stationary Policy Optimization with Bellman Residual on Video Games Chen Gong (Institute of Automation, Chinese Academy of Sciences)*; Qiang He (Institute of Automation, Chinese Academy of Sciences); Yunpeng Bai (Institute of Automation, Chinese Academy of Sciences); Xinwen Hou (Institute of Automation, Chinese Academy of Sciences); Guoliang Fan (Institute of Automation, Chinese Academy of Sciences); Yu Liu (Institute of Automation, Chinese Academy of Sciences)
1076 ACSNet: Adaptive Cross-scale Network with Feature Maps Refusion for Vehicle Density Detection Zuhao Ge (Shantou University); Yuhui Li (Shantou University); Cheng Liang (Shantou University); Youyi Song (The Hong Kong Polytechnic University); Teng Zhou (Shantou University)*; Jing Qin (The Hong Kong Polytechnic University)
1150 Unsupervised Domain Adaptation via Cluster Alignment with Maximum Classifier Discrepancy Mohamed Azzam (City University of Hong Kong); Si Wu (South China University of Technology); Aurele Tohokantche Gnanha  (City University of Hong Kong); Qianfen Jiao (City University of Hong Kong); Hau San Wong (City University of Hong Kong)*
1495 LIDAR-BASED REAL-TIME MAPPING FOR DIGITAL TWIN DEVELOPMENT Evan Brock (University of Tennessee at Chattanooga); Chengxuan Huang (University of California, Davis); Dalei Wu (University of Tennessee at Chattanooga)*; Yu Liang (University of Tennessee at Chattanooga)
O30 Multimedia Applications II
Time  
Chair Xinggong Zhang (Peking University)
ID Title Author
204 CoConv: Learning Dynamic Cooperative Convolution for Image Recognition Kien X Nguyen (Texas Christian University); Tiffany Ryu (University of North Texas); Jocelyn Zhang (University of North Texas); Xu Ma (University of North Texas)*; Qing Yang (University of North Texas); Song Fu (University of North Texas); Paparao Palacharla (Fujitsu Network Communications); Nannan Wang (Fujitsu Network Communications); Xi Wang (Fujitsu Network Communications)
1192 DRL-based Collaborative Edge Content Replication with Popularity Distillation Haopeng Yan (Tsinghua University); Zeming Chen (Tsinghua University); Zhi Wang (Tsinghua University); Wenwu Zhu (Tsinghua University)*
656 Handwriting Trajectory Recovery from Off-Line Multi-Stroke Characters by Deep Ordering Prediction and Heuristic Search Tie-Qiang Wang (CASIA)*; Cheng-Lin Liu (Institute of Automation of Chinese Academy of Sciences)
239 CNN-Based Depth Map Prediction for Fast Block Partitioning in HEVC Intra Coding Aolin Feng (University of Science and Technology of China); Changsheng Gao (University of Science and Technology of China); Li Li (University of Science and Technology of China); Dong Liu (University of Science and Technology of China)*; Feng Wu (University of Science and Technology of China)
1463 A REAL-TIME H.266/VVC SOFTWARE DECODER Bin Zhu (Tencent America); Shan Liu (Tencent America); Yuan Liu (Tencent America); Yi Luo (Tencent America); Jing Ye (Tencent America); Haiyan Xu (Tencent America); Ying Huang (Tencent America); Hualong Jiao (Tencent America); Xiaozhong Xu (Tencent America)*; Xianguo Zhang (Tencent); Chenchen Gu (Tencent)
16 On Forecasting Dynamics in Online Discussion Forums Chen Ling (University of Delaware); Di Cui (University of Delaware); Guangmo Tong (University of Delaware)*; Jianming ZHU (University of Chinese Academy of Sciences)
O31 Special Session: Multimedia Knowledge-Driven Deep Analysis and Forensics/Security
Time  
Chair Chang-Tsun Li (Deakin University)
ID Title Author
209 Robust Image Denoising with Texture-Aware Neural Network Bo Fu (Dalian University of Technology)*; Liyan Wang (Liaoning Normal University); Zhongxuan Luo (DALIAN UNIVERSITY OF TECHNOLOGY)
555 Multi-Graph Based Hierarchical Semantic Fusion for Cross-Modal Representation Lei Zhu ()*; Chengyuan Zhang (Hunan University); Jiayu Song (Central South University); Liangcheng Liu (UniversityofMelbourne); Shichao Zhang (); Yangding Li (Hunan Normal University)
96 ON CONSTRUCTING A BETTER CORRELATION PREDICTOR FOR PRNU-BASED IMAGE FORGERY LOCALIZATION Xufeng Lin (Deakin University)*; Chang-Tsun Li (Deakin University, Australia)
337 VideoForensicsHQ: Detecting High-quality Manipulated Face Videos Gereon Fox (Max Planck Institute for Informatics)*; Wentao Liu (Max Planck Institute for Informatics); Hyeongwoo Kim (Max Planck Institute for Informatics); Hans-Peter Seidel (Max Planck Institute for Informatics); Mohamed Elgharib (Max Planck Institute for Informatics); Christian Theobalt (MPI Informatik)
1185 DEFAKEHOP: A LIGHT-WEIGHT HIGH-PERFORMANCE DEEPFAKE DETECTOR Hong-Shuo Chen (USC)*; Mozhdeh Rouhsedaghat (University of Southern California); Hamza H Ghani (USC); Shuowen Hu (US Army Research Laboratory); Suya You (U.S. Army Research Laboratory); C.-C. Jay Kuo (USC)
O32 Industry and Application Track I
Time  
Chair Zhang Wei (Singapore Institute of Technology)
ID Title Author
W11 MT-GAN: A Training Framework to Enhance Image Classification Task with Image Translation Qun Li (Microsoft)*; Changbo Hu (Microsoft); Keng-hao Chang (Microsoft); Ruofei Zhang (Microsoft)
W90 A time-variant QoE model based on real video streaming data Shengbin Meng (ByteDance Inc.)*; Minyin Zeng (ByteDance Inc.); Junlin Li (ByteDance Inc.); Yue Wang (Beijing ByteDance Technology Co., Ltd.); Zongming Guo (Peking University)
W111 Hardware-aware Model Optimization Tool For Embedded Devices Cagri Ozcinar (Samsung)*; Dongsun Kim (Samsung R&D Institute UK); Ben Rufus Duckworth (Samsung R&D Institute UK); Shayan Joya (Samsung R&D Institute UK); Nicolas  Scotto Di Perto (Samsung R&D Institute UK); Attila Dusnoki (University of Szeged); Márkó Fabó (University of Szeged ); Dániel Vince (University of Szeged); Gábor Lóki (University of Szeged ); Ákos Kiss (University of Szeged); Christopher Alder (Samsung R&D Institute UK)
W117 MULTI-MODAL FUSION ENHANCED MODEL FOR DRIVER’S FACIAL EXPRESSION RECOGNITION Jianrong Chen (University of California, San Diego)*; Sujit Dey (University of California, San Diego); Lei Wang (Qualcomm); Ning Bi (Qualcomm); Peng Liu (Qualcomm)
O33 Image/video acquisition and compression
Time  
Chair Xin Zhao (Tencent)
ID Title Author
698 Thousand to One: Semantic Prior Modeling for Conceptual Coding Jianhui Chang (Peking University)*; Zhenghui Zhao (Peking University); Lingbo Yang (Peking University); Chuanmin Jia (Peking University); Jian Zhang (Peking University Shenzhen Graduate School); Siwei Ma (Peking University, China)
1063 Spatial-Temporal Synergic Prior Driven Unfolding Network for Snapshot Compressive Imaging Zhuoyuan Wu (PKU)*; Zhenyu Zhang (PKU); Jiechong Song (PKU); Jian Zhang (Peking University Shenzhen Graduate School)
1068 EFFICIENT VIDEO COMPRESSED SENSING RECONSTRUCTION VIA EXPLOITING SPATIAL-TEMPORAL CORRELATION WITH MEASUREMENT CONSTRAINT Zhichao Wei (South China University of Technology)*; Chunling Yang (South China University of Technology ); Yunyi Xuan (South China University of Technology)
1369 Enhanced Implicit Selection of Transform Skip in AVS3 liqiang wang (Tencent)*; Xiaozhong Xu (Tencent America); Shan Liu (Tencent America)
205 VANet: A View Attention Guided Network for 3D Reconstruction from Single and Multi-view Images Yi Yuan (NetEase Fuxi AI Lab)*; Jilin Tang (NetEase Fuxi AI Lab); Zhengxia Zou (University of Michigan)
226 DIFFERENTIABLE LIGHT-WEIGHT ARCHITECTURE SEARCH Yuxu Mao (Ocean University of China); Guoqiang Zhong (Ocean University of China)*; Yanan Wang (Ocean University of China); Zhaoyang Deng (Ocean University of China)
O34 Multimedia analysis and understanding III
Time  
Chair Ming-Ching Chang (University at Albany - SUNY)
ID Title Author
1399 MPN: Multimodal Parallel Network for Audio-Visual Event Localization Jiashuo Yu (Fudan University)*; Ying Cheng (Fudan University); Rui Feng (Fudan University)
1488 Learning Content and Context with Language Bias for Visual Question Answering Chao Yang (Hunan University)*; Su Feng (Hunan University); Dongsheng Li (Microsoft Research Asia); Huawei Shen (Institute of Computing Technology, Chinese Academy of Sciences); Guoqing Wang (Hunan University); Bin Jiang (Hunan University)
627 Efficient Human Pose Estimation by Learning Deeply Aggregated Representations Zhengxiong Luo (Institute of Automation,Chinese Academy of Sciences)*; Zhicheng Wang (Megvii); Yuanhao Cai (Tsinghua Univisity, Tsinghua Shenzhen International Graduate School); Guan'an Wang (CASIA); Liang Wang (NLPR, China); Yan Huang (Institute of Automation, Chinese Academy of Sciences); Erjin Zhou (Megvii Research); Tieniu Tan (NLPR, China); Jian Sun (Megvii Technology)
1180 An Efficient Approach for Audio-Visual Emotion Recognition with Missing Labels and Missing Modalities Fei Ma (Tsinghua-Berkeley Shenzhen Institute, Tsinghua University)*; Shao-Lun Huang (TBSI); Lin Zhang (Tsinghua University, China)
1478 ConSK-GCN: Conversational Semantic- and Knowledge-oriented  Graph Convolutional Network for Multimodal Emotion Recognition  Yahui Fu (Tianjin University)*; Shogo Okada (Japan Advanced Institute of Science and Technology); Longbiao Wang (Tianjin University); Lili Guo (Tianjin University); Yaodong Song (Tianjin University); Jiaxing Liu (Tianjin University); Jianwu Dang (Tianjin University)
O35 Special Session: Advances in Language, Vision, and Limited Supervision
Time  
Chair Yi Cai (South China University of Technology)
ID Title Author
459 MNRE: A Challenge Multimodal Dataset for Neural Relation Extraction with Visual Evidence in Social Media Posts Changmeng Zheng (South China University of Technology); Zhiwei Wu (School of Software Engineering, South China University of Technology); Junhao Feng (South China University of Technology); Ze Fu (School of Software Engineering, South China University of Technology); Yi Cai (School of Software Engineering, South China University of Technology)*
585 MULTIMODAL FUSION NETWORK WITH LATENT TOPIC MEMORY FOR RUMOR DETECTION jiaxin chen (Guangdong University of Technology)*; Zekai Wu (Guangdong University of Technology ); Zhenguo Yang (Guangdong University of Technology); Haoran Xie (Lingnan University); Fu Lee Wang (The Open University of Hong Kong); Wenyin Liu (Guangdong University of Technology)
887 DCNet: Dual-task Cycle Network for End-to-End Image Dehazing Zhihua Chen (East China University of Science and Technology); Yu Zhou (East China University of Science and Technology); Ping Li (The Hong Kong Polytechnic University); Xiaoyu Chi (Goertek Co., Ltd); Lei Ma (Peking University); Bin Sheng (Shanghai Jiao Tong University)*
1394 Person Retrieval in Physical World Wenxin Huang (Hubei University)*; Dongyang Li (Wuhan University); Ruimin Hu (Wuhan University); Chao Liang (Wuhan University); Xian Zhong (Wuhan University of Technology)
43 Image Captioning with Inherent Sentiment tong li (Beijing Institute of Technology)*; yunhui hu (Beijing Institute of Technology); Xinxiao Wu (Beijing Institute of Technology)
O36 Industry and Application Track II
Time  
Chair Lukas Esterle (Aarhus University)
ID Title Author
W124 EXTENDED GUIDED IMAGE FILTERING FOR CONTRAST ENHANCEMENT JIAFEI WU (SenseTime Research)*; Gengjie Li (SenseTime Research); Chong Wang (Ningbo University); Huakai Liu (SenseTime Research); shuai zhang (Sensetime Ltd); Guangcheng Zhang (SenseTime Research)
W129 Fine-Grained Texture Identification for Reliable Product Traceability Junsong Wang (Easy-Visible)*; Yubo Li (V-Origin Technology); ZhiYong Chang (V-Origin Technology); Haitao Yue,(V-Origin Technology); Yonghua Lin (V-Origin Technology)
W132 A LIGHTWEIGHT APPROACH FOR WOOD HYPERSPECTRAL IMAGES CLASSIFICATION Phyu Phyu Htun (University of Computer Studies, Yangon.)*; Marco Boschetti (Microtec srl GmbH); Attaullah Buriro (University of Bolzano); Roberto Confalonieri (Free University of Bozen-Bolzano); Boyuan Sun (Free University of Bolzano); Ah Nge Htwe (University of Computer Studies, Yangon.); Tammam Tillo (Indraprastha Institute of Information Technology Delhi)
W138 Low Complexity Implementation of Intra String Copy in AVS3 Yingbin Wang (Tencent); Xiaozhong Xu (Tencent America)*; Shan Liu (Tencent America)