| O1 | Image/Video Enhancement I | 
 
  | Time |  | 
 
  | Chair | Maggie Zhu (Purdue University) | 
 
  | ID | Title | Author | 
 
  | 88 | FGF-GAN:
  A Lightweight Generative Adversarial Network For Pansharpening Via Fast
  Guided Filter | Zixiang
  Zhao (Xi’an Jiaotong University)*; Jiangshe Zhang (Xi'an Jiaotong
  University); Shuang Xu (Xi'an Jiaotong University); Kai Sun (Xi'an Jiaotong
  University); Lu Huang (Xi’an Jiaotong University); Junmin Liu (Xi'an Jiaotong
  University); Chunxia Zhang (Xi'an Jiaotong University) | 
 
  | 253 | Collaborative
  Reflectance-and-Illumination Learning for High-Efficient Low-light Image
  Enhancement | Guijing
  Zhu (Dalian University of Technology); Long Ma (Dalian University of
  Technology); Risheng Liu (Dalian University of Technology)*; Xin Fan (Dalian
  University of Technology); Zhongxuan Luo (DALIAN UNIVERSITY OF TECHNOLOGY) | 
 
  | 308 | Organ-Branched-CNN
  for Robust Face Super-Resolution | Jichun
  Li (Fudan University); Bahetiyaer Bare (Fudan University); Shili Zhou (Fudan
  University); Bo Yan (Fudan University)*; Ke Li (Fudan University) | 
 
  | 350 | Learning
  Long-Term Style Preserving Blind Video Temporal Consistency | Hugo
  Thimonier (L'Oréal Research and Innovation)*; Julien Despois (L’Oréal
  Research and Innovation); Robin Kips (L'Oréal Research and Innovation);
  Matthieu Perrot (	L’Oréal Research and Innovation) | 
 
  | 441 | ISTA-Net++:
  Flexible Deep Unfolding Network for Compressive Sensing | Di
  You (Peking University); Jingfen Xie (Peking University); Jian Zhang (Peking
  University Shenzhen Graduate School)* | 
 
  | 456 | Spatial
  Graph Convolutional Network for Image Super-Resolution | Yue
  Yang (Xi’an Jiaotong University)*; Yong Qi (Xi’an Jiaotong University) | 
 
  |  |  |  | 
 
  | O2 | Cross-modal and multi-modal multimedia analysis | 
 
  | Time |  | 
 
  | Chair | Bihan Wen (Nanyang Technological University) | 
 
  | ID | Title | Author | 
 
  | 41 | HIERARCHICAL
  REPRESENTATION NETWORK WITH AUXILIARY TASKS FOR VIDEOCAPTIONING | Yu
  Lei (University of Electronic Science and Technology of China); Zhonghai He
  (UESTC)*; Pengpeng Zeng (University of Electronic Science and Technology of
  China); Jingkuan Song (UESTC); Lianli Gao (The University of Electronic
  Science and Technology of China) | 
 
  | 115 | Label-specific
  Alignment with Adversarial Multi-view Representation | Yi
  Zhang (Nanjing University)*; Jundong Shen (Nanjing University); Cheng Yu
  (	Nanjing University); Chongjun Wang (Nanjing University) | 
 
  | 214 | Weakly-supervised
  Audio-visual Sound Source Detection and Separation | Tanzila
  Rahman (University of British Columbia )*; Leonid Sigal (University of
  British Columbia) | 
 
  | 799 | Combine
  Early and Late Fusion Together: A Hybrid Fusion Framework for Image-Text
  Matching | Yifan
  Wang (University of Electronic Science and Technology of China); Xing Xu
  (University of Electronic Science and Technology of China)*; Wei Yu
  (University of Electronic Science and Technology of China); Ruicong Xu
  (MEITUAN); Zuo Cao (MEITUAN); Heng Tao Shen (University of Electronic Science
  and Technology of China (UESTC)) | 
 
  | 1137 | Tensor-based
  Multi-view Block-diagonal Structure Diffusion for Clustering Incomplete
  Multi-view Data | Zhenglai
  Li (China University of Geosciences); Chang Tang (China University of
  Geosciences)*; Xinwang Liu (National University of Defense Technology); Xiao
  Zheng (National University of Defense Technology); Wei Zhang (Qilu University
  of Technology); En Zhu (National University of Defense Technology) | 
 
  | 1389 | Multi-Dimensional
  Attentive Hierarchical Graph Pooling Network for Video-Text Retrieval | Dehao
  Wu (Peking University Shenzhen Graduate School)*; Yi Li (Peking University
  Shenzhen Graduate School); Yinghong Zhang (Peking University Shenzhen
  Graduate School); Yuesheng Zhu (Peking University Shenzhen Graduate School) | 
 
  |  |  |  | 
 
  | O3 | Emerging applications of artificial
  intelligence | 
 
  | Time |  | 
 
  | Chair | Zhang Wei (Singapore Institute of Technology) | 
 
  | ID | Title | Author | 
 
  | 566 | Class
  Forge: Boosting Feature Encoder for Few-shot Learning with Synthesized
  Classes | Rui-Qi
  Wang (Institute of Automation, Chinese Academy of Sciences)*; Xu-Yao Zhang
  (Institute of Automation of Chinese Academy of Sciences); Cheng-Lin Liu
  (Institute of Automation of Chinese Academy of Sciences) | 
 
  | 568 | GSS:
  Graph-based Subspace Learning with Shots Initialization for Few-shot
  Recognition | Rui-Qi
  Wang (Institute of Automation, Chinese Academy of Sciences)*; Xu-Yao Zhang
  (Institute of Automation of Chinese Academy of Sciences); Cheng-Lin Liu
  (Institute of Automation of Chinese Academy of Sciences) | 
 
  | 688 | Truth
  Inference with Bipartite Attention Graph Neural Network from a Comprehensive
  View | Jiacheng
  Liu (Shanghai Jiao Tong University); Feilong Tang (Shanghai Jiao Tong
  University)*; Jielong Huang (Alibaba Group) | 
 
  | 714 | Calibration
  for Non-exemplar based Class-incremental Learning | Fei
  Zhu (Institute of Automation of Chinese Academy of Science)*; Xu-Yao Zhang
  (Institute of Automation of Chinese Academy of Sciences); Cheng-Lin Liu
  (Institute of Automation of Chinese Academy of Sciences) | 
 
  | 746 | Revisiting
  Graph Neural Networks for Node Classification in Heterogeneous Graphs | Ye
  Tao (Peking University)*; Ying Li (Peking University); Zhonghai Wu (Peking
  University) | 
 
  | 759 | DDPER:
  Decentralized Distributed Prioritized Experience Replay | Sidun
  Liu (NUDT); Peng Qiao (NUDT)*; Yong Dou (National University of Defense
  Technology); Rongchun Li (National Laboratory for Parallel and Distributed
  Processing, National University of Defense Technology,Changsha,Hunan) | 
 
  |  |  |  | 
 
  | O4 | Multimedia databases and data mining | 
 
  | Time |  | 
 
  | Chair | Yueqi Duan (Stanford University) | 
 
  | ID | Title | Author | 
 
  | 370 | HAZY
  RE-ID: AN INTERFERENCE SUPPRESSION MODEL FOR DOMAIN ADAPTATION PERSON
  RE-IDENTIFICATION UNDER INCLEMENT WEATHER CONDITION | Jian
  Pang (China University of Petroleum (East China)); Dacheng Zhang (Kunming
  University of Science and Technology); Huafeng Li (Kunming University of
  Science and Technology)*; Weifeng Liu (China University of Petroleum (East
  China)); Zhengtao Yu (Kunming University of Science and Technology) | 
 
  | 440 | Adaptive
  Deep Metric Ensemble Learning with Consensus | Ping
  Li (Hangzhou Dianzi University)*; Guopan Zhao (Hangzhou Dianzi University);
  Huaxin Xiao (National University of Defense Technology) | 
 
  | 682 | Weakly-Supervised
  Online Hashing | Yu-Wei
  Zhan (Shandong University); Xin 
  Luo (Shandong University)*; Yu Sun (Shandong University); Yongxin Wang
  (Shandong University); Zhen-Duo Chen (Shandong University); Xin-Shun Xu
  (Shandong University) | 
 
  | 761 | Deep
  Unsupervised Hashing by Distilled Smooth Guidance | Xiao
  Luo (Peking University); Zeyu Ma (Harbin Institute of Technology, Shenzhen);
  Daqing Wu (Peking University); Huasong Zhong (Alibaba); Chong Chen (Alibaba);
  Jinwen Ma (Peking University); Minghua Deng (Peking University)* | 
 
  | 647 | Tensor-based
  Unsupervised Multi-view Feature Selection for Image Recognition | Yongshan
  Zhang (China University of Geosciences)*; Xinxin Wang (China University of
  Geosciences); Zhihua Cai (China University of Geosciences); Yicong Zhou
  (University of Macau); Philip S Yu (UNIVERSITY OF ILLINOIS AT CHICAGO) | 
 
  | 1129 | Supervised
  Video Summarization via Multiple Feature Sets with Parallel Attention | Junaid
  Ahmed Ghauri (TIB - Leibniz Information Centre for Science and Technology)*;
  Sherzod Hakimov (TIB - Leibniz Information Centre for Science and
  Technology); Ralph Ewerth (TIB - Leibniz Information Center for Science and
  Technology) | 
 
  |  |  |  | 
 
  | O5 | Speech/audio synthesis and coding | 
 
  | Time |  | 
 
  | Chair | Jahangir Alam (Computer Research Institute of
  Montreal) | 
 
  | ID | Title | Author | 
 
  | 451 | CROSS-DOMAIN
  SINGLE-CHANNEL SPEECH ENHANCEMENT MODEL WITH  BI-PROJECTION FUSION MODULE FOR
  NOISE-ROBUST ASR | Fu-An
  Chao (National Taiwan Normal University)*; Jeih-weih Hung (National Chi Nan
  University); Berlin Chen (National Taiwan Normal University) | 
 
  | 79 | FastSVC:
  Fast Cross-Domain Singing Voice Conversion with Feature-wise Linear
  Modulation | Songxiang
  LIU (The Chinese University of Hong Kong)*; Yuewen Cao (CUHK); Na Hu
  (Tencent); Dan Su (Tencent); Helen Meng (The Chinese University of Hong Kong) | 
 
  | 709 | Spatial
  audio object coding based on time-frequency shifting and scheduling | Chenhao
  Hu (wuhan university); Ruimin Hu (Wuhan University)*; Xiaochen Wang (Wuhan
  University); Yulin Wu (Wuhan University) | 
 
  | 711 | LOW
  BITRATES AUDIO OBJECT CODING USING CONVOLUTIONAL AUTO-ENCODER AND DENSENET
  MIXTURE MODEL | Yulin
  Wu (Wuhan University); Ruimin Hu (Wuhan University)*; Chenhao Hu (wuhan
  university); Shanfa Ke (Wuhan University); Gang Li (Wuhan University);
  Xiaochen Wang (Wuhan University) | 
 
  | 1022 | Efficient
  multi-step audio object coding with limited residual information | Chenhao
  Hu (wuhan university); Ruimin Hu (Wuhan University)*; Xiaochen Wang (Wuhan
  University); Yulin Wu (Wuhan University); Wenke Liu (Wuhan University) | 
 
  | 964 | Deep
  Speaker Conditioning for Speech Emotion Recognition | Andreas
  Triantafyllopoulos (audEERING GmbH / University of Augsburg)*; Shuo Liu
  (University of Augsburg); Björn Schuller (University of Augsburg) | 
 
  |  |  |  | 
 
  | O6 | Special
  Session: Deep Learning for Multimedia Applications with Limited Supervision | 
 
  | Time |  | 
 
  | Chair | Joey
  Tianyi Zhou (National University of Singapore) | 
 
  | ID | Title | Author | 
 
  | 107 | Near
  Real Feature Generative Network for Generalized Zero-Shot Learning | Jingren
  Liu (Nanjing University of Science and Technology); Haoyue Bai (Nanjing
  University of Science and Technology); Haofeng Zhang (Nanjing University of
  Science and Technology)*; Li Liu (the inception institute of artificial  intelligence) | 
 
  | 124 | Saliency-Guided
  Complementary Attention for Improved Few-Shot Learning | Linglan
  Zhao (Shanghai Jiao Tong University)*; Ge Liu (Shanghai Jiao Tong
  University); Da-shan Guo (Shanghai Jiao Tong University); Wei Li (Shanghai
  Jiao Tong University); Xiangzhong Fang (Shanghai Jiao Tong University) | 
 
  | 271 | Unsupervised
  Video Person Re-identification via Noise and Hard frame Aware Clustering | Pengyu
  Xie (Wuhan University of Science and Technology); Xin Xu (Wuhan University of
  Science and Technology)*; Zheng Wang (The University of Tokyo); Toshihiko
  Yamasaki (The University of Tokyo) | 
 
  | 298 | Dual-regularization
  Complementary Learning for Image Classification | Lingjuan
  Ge (Wuhan University); Mingming Gong (University of Melbourne); Yutian Lin
  (Wuhan University)*; Bo Du (Wuhan University) | 
 
  | 411 | Multi-domain
  Synchronous Refinement Network for Unsupervised Cross-Domain Person
  Re-Identification | Sikai
  Bai (	Northwestern Polytechnical University); Junyu Gao (Northwestern
  Polytechnical University, Center for OPTical IMagery Analysis and Learning);
  Qi Wang (Northwestern 
  Polytechnical University)*; Xuelong Li (Northwestern Polytechnical
  University) | 
 
  | 675 | Few-Shot
  Defect Segmentation Leveraging Abundant Defect-free Training Samples Through
  Normal Background Regularization and Crop-and-Paste Operation | Dongyun
  Lin (Institute for Infocomm Research)*; Yanpeng Cao (ZJU); Wenbin Zhu
  (Zhejiang University); Yiqun Li (Institute for Infocomm Research) | 
 
  |  |  |  | 
 
  | O7 | Multimedia activity analysis and
  understanding | 
 
  | Time |  | 
 
  | Chair | Zhiyong
  Wang (The University of Sydney) | 
 
  | ID | Title | Author | 
 
  | 80 | Relationship-aware
  Primal-Dual Graph Attention Network for Scene Graph Generation | Hao
  Zhou (National University of Defense Technology); Tingjin Luo (College of
  Liberal Arts and Sciences, National University of Defense Technology)*; Jun
  Zhang (Science and Technology on Information Systems Engineering Laboratory,
  National University of Defense Technology); Jun Lei (National University of
  Defense Technology); Shuohao LI (College of Information System and
  Management, National University of Defense Technology) | 
 
  | 100 | PAL-Net:
  Predicate-Aware Learning Network for Visual Relationship Recognition | Liang
  Xu (Shanghai Jiao Tong University); Yong-Lu Li (Shanghai Jiao Tong
  University); Mingyang Chen (Shanghai Jiaotong University); Yan Hao (Shanghai
  Jiao Tong University); Cewu Lu (Shanghai Jiao Tong University)* | 
 
  | 215 | DIVING
  INTO THE RELATIONS: LEVERAGING SEMANTIC AND VISUAL STRUCTURES FOR VIDEO
  MOMENT RETRIEVAL | Ziyue
  Wu (Student)*; Junyu Gao (CASIA); Shucheng Huang (Jiangsu University of
  Science and Technology); Changsheng Xu (CASIA) | 
 
  | 563 | Multimodal-Semantic
  Context-Aware Graph Neural Network for Group Activity Recognition | Tianshan
  Liu (The Hong Kong Polytechnic University)*; Rui Zhao (The Hong Kong
  Polytechnic University	); Kin-Man Lam (The Hong Kong Polytechnic University) | 
 
  | 676 | Temporally
  Coarse to Fine Snippets Relationship Learning with Graph Convolution for
  Temporal Action Proposal Generation | Shuaicheng
  1 Li (Fudan University)*; Rui-Wei Zhao (Fudan University); Shuyu Miao (Fudan
  University); Rui Feng (Fudan University) | 
 
  | 906 | Recurrent
  Graph Convolutional Autoencoder for Unsupervised Skeleton-Based Action
  Recognition | Han
  Yao (Tongji University); S-J Zhao (HaiBa Technology)*; Chi Xie (Tongji
  University); Kenan Ye (Tongji University); Shuang Liang (Tongji University) | 
 
  |  |  |  | 
 
  | O8 | Image/Video
  Enhancement II | 
 
  | Time |  | 
 
  | Chair | Bihan
  Wen (Nanyang Technological University) | 
 
  | ID | Title | Author | 
 
  | 741 | Structure-Resonant
  Discriminator for Image Super-Resolution | Jaerin
  Lee (Seoul National University)*; Kyoung Mu Lee (Seoul National University) | 
 
  | 846 | Asymmetric
  Stereo Color Transfer | Yicheng
  Wang (University of Science and Technology of China); Jiayong Peng
  (University of Science and Technology of China); Yueyi Zhang (University of
  Science and Technology of China); Shan Liu (Tencent America); Xiaoyan Sun
  (University of Science and Technology of China); Zhiwei Xiong (University of
  Science and Technology of China)* | 
 
  | 878 | Residual
  Attention Block Search for Lightweight Image Super-Resolution | Wenrui
  Liao (HFUT); Zhong-Qiu Zhao (HFUT)*; Hao Shen (HFUT); Weidong Tian (HFUT) | 
 
  | 893 | HALDeR:
  Hierarchical Attention-guided Learning with Detail-refinement for
  Multi-Exposure Image Fusion | Jinyuan
  Liu (Dalian University of Technology); JingJie Shang (Dalian University of
  Technology); Risheng Liu (Dalian University of Technology); Xin Fan (Dalian
  University of Technology)* | 
 
  | 1020 | Deep
  Deblocker Driven Adaptive Iteration Scheme for Compressed Image Recovery | Chao
  Ren (Sichuan University)*; Xiaohai He (Sichuan University); Linbo Qing
  (Sichuan University, China); Yuanzhouhan Cao (Beijing Jiaotong University) | 
 
  | 1094 | Structure-Oriented
  Progressive  Low-rank Image
  Restoration for Defending Adversarial Attacks | Zhiqun
  Zhao (University of Missouri-Columbia); Hengyou Wang (Beijing University of
  Civil Engineering and Architecture); HAO SUN (University of
  Missouri-Columbia); Wenming Cao (Shenzhen University); Zhihai He (University
  of Missouri Columbia)* | 
 
  |  |  |  | 
 
  | O9 | Multimedia representation learning | 
 
  | Time |  | 
 
  | Chair | Wei-Ta
  Chu (National Cheng Kung University) | 
 
  | ID | Title | Author | 
 
  | 9 | Fine-Grained
  Image Retrieval via Multiple Part-level Feature Ensemble | Gang
  Cao (Shenzhen University); Yingying Zhu (Shenzhen University)*; Xiufan Lu
  (Shenzhen University) | 
 
  | 290 | Cross-View
  Equivariant Auto-Encoder | Zhibin
  Wan (School of Intelligence and Computing, Tianjin University); Changqing
  Zhang (Tianjin university)*; Yu Geng (Tianjin University); Huazhu Fu
  (Inception Institute of Artificial Intelligence); Xi Peng (College of
  Computer Science, Sichuan Univerisity); Pengfei Zhu (tianjin university);
  Qinghua Hu (Tianjin University) | 
 
  | 469 | Noise
  Homogenization via Multi-Channel Wavelet Filtering for High-Fidelity Sample
  Generation in GANs | Shaoning
  Zeng (Yangtze Delta Region Institute (Hu Zhou), University of Electronic
  Science and Technology of China)*; Bob Zhang (Univerisity of Macau) | 
 
  | 471 | Semantically-Guided Disentangled
  Representation for Robust Gait Recognition | Tianrui
  Chai (Beihang University)*; Xinyu Mei (Beihang University); Annan Li (Beijing
  University of Aeronautics and Astronautics); Yunhong Wang (State Key
  Laboratory of Virtual Reality Technology and System, Beihang University,
  Beijing 100191, China) | 
 
  | 480 | Self-Guided
  Deep Multi-view Subspace Clustering Network | Beilei
  Cui (Dalian University of Technology); Hong Yu (Dalian University of
  Technology)*; Linlin Zong (Dalian University of Technology); Ziyang Cheng
  (Dalian University Of Technology) | 
 
  | 624 | Efficient
  Sketch Recognition via Compact Spatial Embedding Graph Neural Networks | Hanhui
  Li (Nanyang Technological University)*; Xudong Jiang (Nanyang Technological
  University); boliang guan (Sun Yat-sen University); Nadia  Magnenat Thalmann (Nanyang
  Technological University) | 
 
  |  |  |  | 
 
  | O10 | 3D
  stereo computing | 
 
  | Time |  | 
 
  | Chair | Shuai
  Li (Shandong University) | 
 
  | ID | Title | Author | 
 
  | 127 | Disparity
  Estimation with Scene Depth Cues | lei
  chen (tsinghua university)*; Zongqing Lu (Tsinghua University international
  Graduate School at Shenzhen); Qingmin Liao (Tsinghua Univeristy); Haoyu Ma
  (Tsinghua University); Jing-Hao Xue (University College London) | 
 
  | 225 | Learning
  Depth from Single Image using Depth-Aware Convolution and Stereo Knowledge | Zhenyao
  Wu (University of South Carolina)*; Xinyi Wu (University of South Carolina);
  Xiaoping Zhang (Wuhan University); Song Wang (University of South Carolina);
  Lili Ju (University of South Carolina) | 
 
  | 295 | Fast
  Multi-Scale Residual Fusion Network for Stereo Matching | Zijing
  Huang ( 	Peking University Shenzhen Graduate School); Jun Peng (Peking
  University Shenzhen Graduate School); Wangduo Xie (Peking University Shenzhen
  Graduate School); Qiuping Li (Peking University Shenzhen Graduate School);
  Yong Zhao (Peking University Shenzhen Graduate School)* | 
 
  | 399 | TAG-Reg:
  Iterative Accurate Global Registration Algorithm | Biao
  Li (Xi'an Jiaotong University); Qixing Xie (Xi'an Jiaotong University);
  Shaoyi Du (Xi'an Jiaotong Unviersity)*; Wenting Cui (Xi'an Jiaotong
  University); Runzhao Yao (Xi'an Jiaotong University); Yue Gao (Tsinghua
  University); nanning zheng (Institute of Artificial Intelligence and
  Robotics, Xi'an Jiaotong University	) | 
 
  | 780 | Better
  stereo matching from simple yet effective wrangling of deep features | lei
  chen (tsinghua university)*; Zongqing Lu (Tsinghua University international
  Graduate School at Shenzhen); Qingmin Liao (Tsinghua Univeristy); Jing-Hao
  Xue (University College London) | 
 
  | 1352 | AUTOMATIC
  CHECKERBOARD DETECTION FOR ROBUST CAMERA CALIBRATION | Ben
  Chen (Huazhong University of Science and Technology; Alibaba Group)*; Yuyao
  Liu (Huazhong University of Science and Technology); Caihua Xiong (School of
  Mechanical Science and Engineering, Huazhong University of Science and
  Technology) | 
 
  |  |  |  | 
 
  | O11 | Multimedia
  for society and health | 
 
  | Time |  | 
 
  | Chair | Liping
  Chen (Microsoft) | 
 
  | ID | Title | Author | 
 
  | 1053 | Sample
  Efficient Lung Segmentation using Group structured Conditional Variational
  Data Imputation | Yan
  Li (East China Normal University); Guitao Cao (East China Normal
  University)*; Wenming Cao (Shenzhen University) | 
 
  | 261 | Integrating
  Performance and Side Factors into Embeddings for Deep Learning-Based
  Knowledge Tracing | Liangliang
  He (National University of Defense Technology)* | 
 
  | 857 | unsupervised
  domain adaptation based image synthesis and synergistic adversarial learning
  for optic disc and cup segmentation | Weixin
  Liu (Shenzhe University); Haijun 
  Lei  (Shenzhen University);
  Hai Xie (Shenzhen University); Benjian Zhao (Shenzhen University); Baiying
  Lei (Shenzhen University)* | 
 
  | 65 | Let's
  Find Fluorescein: Cross-Modal Dual Attention Learning for Fluorescein Leakage
  Segmentation in Fundus Fluorescein Angiography | Yang
  Wen (School of Computer Science and Engineering, University of Electronic
  Science and Technology of China); Leiting Chen (School of Computer Science
  and Engineering, University of Electronic Science and Technology of China);
  Lifeng Qiao (University of Electronic Science and Technology of China); Yu
  Deng (King's College London); Haisheng Chen (University of Electronic Science
  and Technology of China); Tian Zhang (School of Computer Science and
  Engineering, University of Electronic Science and Technology of China); Chuan
  Zhou (School of Computer Science and Engineering, University of Electronic
  Science and Technology of China)* | 
 
  | 704 | Shape-Adaptive
  Convolutional Operator for Breast Ultrasound Image Segmentation | Kuan
  Huang (Utah State University); Yingtao Zhang (Harbin Institute of
  Technology); H. D. Cheng (Utah State University)*; Ping Xing (First
  Affiliated Hospital of Harbin Medical University) | 
 
  | 941 | Bias
  Field Poses a Threat to DNN-based X-Ray Recognition | Binyu
  Tian (Tianjin University); Qing Guo (Nanyang Technological University)*;
  Felix Juefei-Xu (Alibaba Group, USA); Wen Le Chan (Nanyang Technological
  University); Yupeng Cheng (Nanyang Technological University, Singapore);
  Xiaohong  Li (Tianjin University);
  Xiaofei Xie (Nanyang Technological University); Shengchao Qin (Teesside
  University) | 
 
  |  |  |  | 
 
  | O12 | Special
  Session: Advancd Video Coding and Deep Active Learning | 
 
  | Time |  | 
 
  | Chair | Hui
  Yuan (Shandong University) | 
 
  | ID | Title | Author | 
 
  | 710 | SPLIT
  UNIT CODING ORDER FOR VIDEO CODING | Yinji
  Piao (Samsung Electronics)*; Kiho Choi (Gachon Univerisity); Min Woo Park
  (Samsung Electronics); Minsoo Park (Samsung Electronics); Kwang Pyo Choi
  (Samsung Electronics) | 
 
  | 787 | IMPROVED
  CHROMA FROM LUMA PREDICTION IN AV1 BASED ON VIRTUAL CHROMA BLOCK GENERATION | Junyan
  Huo (Xidian University)*; Menglin Zhang (Xidian University); Wenhan Qiao
  (Xidian University); FuZheng Yang (Xidian University); Hui Su (Google
  Inc.);  Debargha Mukherjee (Google
  Inc) | 
 
  | 904 | ANGULAR
  WEIGHTED PREDICTION FOR NEXT-GENERATION VIDEO CODING STANDARD | Yucheng
  Sun (Hikvision Research Institute); Fangdong Chen (Hikvision Research
  Institute); Li Wang (Hikvision Research Institute); Shiliang Pu (Hikvision
  Research Institute)* | 
 
  | 223 | Meta-Learning
  Causal Feature Selection for Stable Prediction | Zhaoquan
  Yuan (School of Computing and Artificial Intelligence, Southwest Jiaotong
  University); Xiao Peng (Southwest Jiaotong University); Xiao Wu (Southwest
  Jiaotong University)*; Bingkun Bao (Nanjing University of Posts and
  Telecommunications ); Changsheng Xu (CASIA) | 
 
  | 1244 | Application
  of Leading Indicator Forecasting based on Optimal Transmission in Financial
  Technology | Tao
  Yin (Shanghai Jiao Tong University); Zhexi Zhang (Shanghai Jiao Tong
  University ); Nianchi Zhang (East China Normal University); Ning Zhang
  (Shanghai Jiao Tong University)* | 
 
  | 1541 | Multi-scale
  Enhanced Active Learning for Skeleton-based Action Recognition | Yuhan
  Zhang (University of Electronic Science and Technology of China)*; Zhiyu Zhao
  (Nanjing University); Wen Li (University of Electronic Science and Technology
  of China); Lixin Duan (University of Electronic Science and Technology of
  China) | 
 
  |  |  |  | 
 
  | O13 | Emerging
  multimedia applications | 
 
  | Time |  | 
 
  | Chair | Zheng
  Wang (The University of Tokyo) | 
 
  | ID | Title | Author | 
 
  | 257 | Capturing
  Implicit Spatial Cues for Monocular 3D Hand Reconstruction | Qi
  Wu (Institute of Intelligent Machines,Chinese Academy of Sciences); Joya Chen
  (University of Science and Technology of China); zhou xu (Hefei Institutes of
  Physical Science,China Academy of Science); ZhiMing Yao (Hefei Institutes of
  Physical Science, Chinese Academy of Sciences); Xianjun Yang (Hefei
  Institutes of Physical Science, Chinese Academy of Sciences)* | 
 
  | 1186 | Efficient
  and Accurate Hypergraph Matching | Jian
  Hou (Dongguan University of Technology)*; Huaqiang Yuan (Dongguan University
  of Technology) | 
 
  | 801 | Zero-shot
  Multi-Focus Image Fusion | Xingyu
  Hu (Harbin Institute of Technology)*; Junjun Jiang (Harbin Institute of
  Technology); Xianming Liu (Harbin Institute of Technology); Jiayi Ma (Wuhan
  University) | 
 
  | 1187 | Attentive
  Update of Multi-Critic for Deep Reinforcement Learning | Qing
  Li (USTC)*; Wengang  Zhou
  (University of Science and Technology of China); Yun Zhou (University of
  Science and Technology of China); Houqiang Li (University of Science and
  Technology of China) | 
 
  | 1347 | Small
  object recognition using a spatio-temporal neural network | Zhibo
  Liang (Harbin Institute of Technology)*; Shaohui Liu (Harbin Institute of
  Technology); Wuzhen Shi (Shenzhen University); Xingtao Wang (Harbin Institute
  of Technology; Peng Cheng Laboratory); Feng Jiang (Harbin Institute of
  Technology, Harbin) | 
 
  | 1565 | Person
  Retrieval with Conv-Transformer | Shengsen
  Wu (Peking University)*; YAN BAI (Peking University); Ce Wang (Peking
  University); Lingyu Duan (Peking University) | 
 
  |  |  |  | 
 
  | O14 | Multimedia semantic segmentation | 
 
  | Time |  | 
 
  | Chair | Duc
  Thanh Nguyen (Deakin University) | 
 
  | ID | Title | Author | 
 
  | 642 | MULTI-SCALE
  FEEDBACK FEATURE REFINEMENT U-NET FOR MEDICAL IMAGE SEGMENTATION | Xiaofei
  Qin (University of Shanghai for Science and  Technology); Minmin Xu (University of
  Shanghai for Science and Technology); Chaoyang Zheng (University of Shanghai
  for Science and Technology); Changxiang He (University of Shanghai for
  Science and  Technology); Xuedian
  Zhang (University of Shanghai for Science and  Technology)* | 
 
  | 898 | Document
  Layout Analysis via Dynamic Residual Feature Fusion | Xingjiao
  Wu (East China Normal University); ZiLing Hu (East China Normal University);
  Xiangcheng Du (East China Normal University); Jing Yang (ECNU)*; Liang He
  (ECNU) | 
 
  | 1109 | SEMI-SUPERVISED
  SEMANTIC SEGMENTATION VIA ENTROPY MINIMIZATION | Jiawei
  Wu (Fujian Agriculture and Forestry University); Haoyi Fan (Harbin University
  of Science and Technology); Xiaoqing Zhang (Minjiang University); Shouying
  Lin (Fujian Agriculture and Forestry University); Zuoyong   Li (Minjiang University)* | 
 
  | 1184 | EFRNET:
  A LIGHTWEIGHT NETWORK WITH EFFICIENT FEATURE FUSION AND REFINEMENT FOR
  REAL-TIME SEMANTIC SEGMENTATION | Kuayue
  Zhang (Tsinghua University); Qingmin Liao (Tsinghua Univeristy); Juncheng
  Zhang (Tsinghua University); Shaojun Liu (Hong Kong University of Science and
  Technology)*; Haoyu Ma (Tsinghua University); Jing-Hao Xue (University
  College London) | 
 
  | 1205 | Weakly-Supervised
  Attribute Segmentation | Guangzhen
  Liu (Renmin University of China); Zhiwu Lu (Renmin University of China)* | 
 
  | 1518 | CONFIDENCE-GUIDED
  ADAPTIVE GATE AND DUAL DIFFERENTIAL ENHANCEMENT FOR VIDEO SALIENT OBJECT
  DETECTION | Pei-Jia
  Chen (Sun Yat-sen University); Jian-Huang Lai (Sun Yat-sen University)*;
  Guangcong Wang (Sun Yat-Sen University); Huajun Zhou (Sun Yat-sen University) | 
 
  |  |  |  | 
 
  | O15 | Image/Video Synthesis and Creation I | 
 
  | Time |  | 
 
  | Chair | Tsung-Wei
  Huang (Dolby Labs) | 
 
  | ID | Title | Author | 
 
  | 45 | Semantic-Aware
  Video Color Style Transfer based on Temporal Consistent Sparse Patch
  Constraint | Yaxin
  Liu (College of Computer Science and Software Engineering, Shenzhen
  University); Xiaoyan Zhang (College of Computer Science and Software
  Engineering, Shenzhen University)*; Xiaogang XU (The Chinese University of
  Hong Kong) | 
 
  | 119 | Learnable
  Sampling 3D Convolution for video enhancement and action recognition | Shuyang
  Gu (University of Science and Technology of China)*; Jianmin Bao (Microsoft
  Research Asia); Dong Chen (Microsoft Research Asia) | 
 
  | 137 | ASTM:
  An Attention based SpatioTemporal Model for Video Prediction Using 3D
  Convolutional Neural Networks | Zheng
  Chang (University of Chinese Academy of Sciences )*; xinfeng zhang
  (University of Chinese Academy of Sciences); Shanshe Wang (Peking
  University); Siwei Ma (Peking University, China); Yan Ye (Alibaba Inc.); Wen
  Gao (PKU) | 
 
  | 191 | Adversarial
  Adaptive Interpolation for Regularizing Representation Learning and Image
  Synthesis in AutoEncoders | Guanyue
  Li (SCUT); Xiwen Wei (South China University of Technology); Sheng Qian
  (Huawei Device Company Limited); Si Wu (South China University of
  Technology)*; Zhiwen Yu (South China University of Technology); Hau San Wong
  (City University of Hong Kong) | 
 
  | 220 | Real-time
  Masked Face Revealing for Video Conference | Jinpeng
  Lin (XiaMenUniversity); Pengfei Liu (School of Informatics, Xiamen
  University); Yinglin Zheng (School of Informatics, Xiamen University); Wenjin
  Deng (School of Informatics, Xiamen University); Ming Zeng (School of
  Informatics, Xiamen University)* | 
 
  | 245 | LI-NET:
  LARGE-POSE IDENTITY-PRESERVING FACE REENACTMENT NETWORK | Jin
  Liu (1. Institute of Information Engineering,Chinese Academy of Sciences. 2.
  School of Cyber Security, University of Chinese Academy of Sciences); Peng
  Chen (1. Institute of Information Engineering,Chinese Academy of Sciences. 2.
  School of Cyber Security, University of Chinese Academy of Sciences); Tao
  Liang (1. Institute of Information Engineering,Chinese Academy of Sciences.
  2. School of Cyber Security, University of Chinese Academy of Sciences);
  Zhaoxing Li (Institute of Information Engineering,Chinese Academy of
  Sciences); Cai Yu (1. Institute of Information Engineering,Chinese Academy of
  Sciences. 2. School of Cyber Security, University of Chinese Academy of
  Sciences); Shuqiao Zou (1. Institute of Information Engineering,Chinese
  Academy of Sciences. 2. School of Cyber Security, University of Chinese
  Academy of Sciences); Jiao Dai (Institute of Information Engineering,Chinese
  Academy of Sciences)*; Jizhong Han (Institute of Information
  Engineering,Chinese Academy of Sciences) | 
 
  |  |  |  | 
 
  | O16 | Object/Person
  detection, Tracking and Recognition I | 
 
  | Time |  | 
 
  | Chair | Chunjie
  Zhang (Beijing Jiaotong University) | 
 
  | ID | Title | Author | 
 
  | 72 | PMAE:
  PSEUDO MULTI-LABEL ATTENTION ENSEMBLE | Xueman
  Wang (Tiangong University); Ling Du (Tiangong University)*; Junbing Li
  (Tianjin University) | 
 
  | 102 | Improving
  Facial Attribute Recognition by Group and Graph Learning | Zhenghao
  Chen (University of Sydney)*; Shuhang Gu (ETH Zurich, Switzerland); Feng Zhu
  (Sensetime Group Limited); Jing Xu (Sensetime Group Limited); Rui Zhao
  (Sensetime Group Limited) | 
 
  | 144 | DSIC:
  Dynamic Sample-Individualized Connector for Multi-scale Object Detection | Zekun
  Li (Institute of automation, Chinese Academy of Sciences); Yufan Liu
  (Institute of Automation, Chinese Academy Sciences); Bing Li (National
  Laboratory of Pattern Recognition (NLPR), Institute of Automation, Chinese
  Academy of Sciences)*; Weiming Hu (Institute of Automation,Chinese Academy of
  Sciences); Yanan Miao (CNCERT); Hong Zhang (CNCERT) | 
 
  | 368 | Object
  Decoupling with Graph Correlation for Fine-Grained Image Classification | Qiushi
  Guo (Alibaba Group)*; Mingchen Zhuge (China University of Geosciences);
  Dehong Gao (Alibaba Group); Huiling Zhou (Alibaba); Xin Wang (Alibaba Group);
  Xiaonan Meng (Alibaba Group) | 
 
  | 428 | Exploring
  Driving-aware Salient Object Detection via Knowledge Transfer | Jinming
  Su (Beihang University); Changqun Xia (Peng Cheng Laboratory)*; Jia Li
  (Beihang University) | 
 
  | 489 | Hands-on
  Guidance for Distilling Object Detectors | Yangyang
  Qin (Huazhong University of Science and Technology)*; Hefei Ling (Huazhong
  University of Science and Technology); Zhenghai He (Huazhong University of
  Science and Technology); Yuxuan Shi (Huazhong University of Science and
  Technology); Lei Wu (Huazhong University of Science and Technology) | 
 
  |  |  |  | 
 
  | O17 | Emerging
  multimedia applications of deep learning I | 
 
  | Time |  | 
 
  | Chair | Wei
  Qi Yan (Auckland University of Technology) | 
 
  | ID | Title | Author | 
 
  | 169 | Enhancing
  Adversarial Examples Via Self-Augmentation | Lifeng
  Huang (SunYat-sen university)*; Chengying Gao (Sun Yat-sen University );
  Wenzi Zhuang (Sun Yat-sen University); Ning Liu (Sun Yat-sen University ) | 
 
  | 178 | Unsupervised
  ensemble learning via network generation | Zhongfan
  Zhang (South China University of Technology); Wenming CAO (The University of
  Hong Kong)*; Cheng Liu (Shantou University); Rui Li (City University of Hong
  Kong); Qianfen Jiao (City University of Hong Kong); Zhiwen Yu (South China
  University of Technology); C. L. Philip Chen  (South China University of
  Technology); Hau San Wong (City University of Hong Kong) | 
 
  | 335 | Learning
  to transfer under unknown noisy environments: an universal weakly-supervised
  domain adaptation method | Xuan
  Liu (Hunan University); Ying Huang (Hunan University)*; Shichang He (Hunan
  University); Jiangjin Yin (Hunan University); Xinning Chen (Hunan
  University); Shigeng Zhang (Central South University) | 
 
  | 651 | Efficient
  training of lightweight neural networks using Online Self-Acquired Knowledge
  Distillation | Maria
  Tzelepi (Aristotle University of Thessaloniki)*; ANASTASIOS TEFAS (Aristotle
  University of Thessaloniki) | 
 
  | 763 | Flexible
  Knowledge Distillation with an Evolutional Network Population | Jie
  Lei (Zhejiang University Of Technology); Zhao Liu (Ping An Life Insurance Of
  China, Ltd.)*; Mingli Song (Zhejiang University); Juan Xu (Pingan Life
  Insurance of China); Jianping Shen (PingAn Life Insurance of China); Ronghua
  Liang (Zhejiang University of Technology) | 
 
  | 838 | Cooperative
  Learning for Noisy Supervision | Hao
  Wu (Cooperative Medianet Innovation Center, Shanghai Jiao Tong University)*;
  Jiangchao Yao (Damo Academy, Alibaba Group); Ya Zhang (Cooperative Medianet
  Innovation Center, Shang hai Jiao Tong University); Yan-Feng Wang
  (Cooperative medianet innovation center of Shanghai Jiao Tong University) | 
 
  |  |  |  | 
 
  | O18 | Multimedia security, privacy and
  forensic I | 
 
  | Time |  | 
 
  | Chair | Jun
  Wan (NLPR, CASIA) | 
 
  | ID | Title | Author | 
 
  | 645 | Multi-task
  Wavelet Corrected Network For Image Splicing Forgery Detection and
  Localization | Xiuli
  Bi (Chongqing University of Posts and Telecommunications); Zhang Zhipeng
  (Chongqing university of post and telecommunications); Liu Yanbin (Chongqing
  University of Posts and Telecommunications); bin xiao (Chongqing University
  of Posts and Telecommunications)*; Weisheng  Li (Chongqing University of Posts and
  Telecommunications) | 
 
  | 1454 | Multi-Modality
  Image Manipulation Detection | Chao
  Yang (Hunan University)*; Zhiyu Wang (Hunan University); Huawei Shen
  (Institute of Computing Technology, Chinese Academy of Sciences); Huizhou Li
  (Hunan University); Bin Jiang (Hunan University) | 
 
  | 200 | Video
  Abnormal Event Detection via Context Cueing Generative Adversarial Network | Zhi
  Zhang (Shenzhen University); Sheng-hua Zhong (Shenzhen University)*; Yan Liu
  (The Hong Kong Polytechnic University) | 
 
  | 247 | Leveraging
  Intra-domain Knowledge to Strengthen Cross-domain Crowd Counting | Yiqing
  Cai (East China Normal University); Lianggangxu Chen (East China Normal
  University); Zhenwei Ma (The Third Research Institute Of Ministry Of Public
  Security); Changhong lu (East China Normal University); Changbo Wang (East
  China Normal University); Gaoqi He (East China Normal University)* | 
 
  | 282 | DISCRIMINATIVE
  AND GEOMETRICALLY ROBUST ZERO-WATERMARKING SCHEME FOR PROTECTING DIBR 3D
  VIDEOS | Xiyao
  Liu (Central South University); Yayun Zhang (Central South University); Sibo
  Du (Central South University); Jian Zhang (Central South University)*;
  Ming  Jiang ( Guilin University of
  Electronic Technology); Hui Fang (Loughborough University) | 
 
  | 1037 | H-StegoNet:
  A Hybrid Deep Learning Framework for Robust Steganalysis | Soumik
  Mondal (A*STAR)*; Yeo  Sze
  Ling  (ASTAR-Institute for
  Infocomm Research, A*STAR); ArulMurugan Ambikapathi (ASTAR-Institute for
  Infocomm Research, A*STAR) | 
 
  |  |  |  | 
 
  | O19 | Special
  Session: Advanced Representation Learning for Robust Multimedia Image
  Understanding | 
 
  | Time |  | 
 
  | Chair | Guangwei
  Gao (Nanjing University of Posts and Telecommunications) | 
 
  | ID | Title | Author | 
 
  | 383 | Learning
  Homogeneous and Heterogeneous Co-Occurrences for Unsupervised Cross-modal
  Retrieval | Yang
  Zhao (Nanjing University of Science and Technology); Weiwei Wang (Nanjing
  University of Science and Technology); Haofeng Zhang (Nanjing University of
  Science and Technology)*; BingZhang Hu (Newcastle University) | 
 
  | 643 | Multimodal
  Transformer Networks with Latent Interaction for Audio-Visual Event
  Localization | Yixuan
  He (University of Electronic Science and Technology of China); Xing Xu
  (University of Electronic Science and Technology of China)*; Xin Liu (Huaqiao
  University); Weihua Ou (Guizhou Normal University); Huimin Lu (Kyushu
  Institute of Technology) | 
 
  | 921 | Disentangling
  Prototype and Variation for Single Sample Face Recognition | MENG
  PANG (Nanyang Technological University); Binghui Wang (Duke University); Mang
  YE (Wuhan University); Yiran Chen (Duke University); Bihan Wen (Nanyang
  Technological University)* | 
 
  | 1178 | Transferable
  Feature Learning on Graphs Across Visual Domains | Ronghang
  Zhu (University of Georgia)*; Xiaodong Jiang (Facebook Inc); Jiasen Lu (Allen
  Institute for AI); Sheng Li (University of Georgia) | 
 
  | 1452 | Face
  Super-Resolution through Dual-identity Constraint | Fangfang
  Cheng (Wuhan Institute of Technology)*; Tao Lu (Wuhan Institute of
  Technology); Yu Wang (Wuhan Institute of technology); Yanduo Zhang (Wuhan
  Institute of Technology) | 
 
  |  |  |  | 
 
  |  |  |  | 
 
  | O20 | Multimedia
  Applications I | 
 
  | Time |  | 
 
  | Chair | Yongshan
  Zhang (University of Macau) | 
 
  | ID | Title | Author | 
 
  | 908 | DGD-NET:
  LOCAL DESCRIPTOR GUIDED KEYPOINT DETECTION NETWORK | Xiaotao
  Liu (Tianjin University); Chen Meng (College of Intelligence and Computing,
  Tianjin University, China); Fei-Peng 
  Tian (Tianjin University); Wei Feng (College of Intelligence and
  Computing, Tianjin University, China)* | 
 
  | 1335 | Multi-view
  Tensor Clustering through Exploiting both Within-view and Across-view
  High-order Correlations | haiyan
  wang (South China University of Technology); Guoqiang Han (South China
  University of Technology); Yu Hu (South China University of Technology); Hong
  Peng (South China University of Technology); Jiazhou Chen (South China
  University of Technology); Bin Zhang (South China University of Technology);
  Hongmin Cai (South China University of Technology)* | 
 
  | 1484 | Path
  Ranking Model For Entity Prediction | xiao
  long (USTC); MingHong Yao (University of Science and Technology of China);
  Liansheng Zhuang (University of Science and Technology of China)*; Houqiang
  Li (University of Science and Technology of China) | 
 
  | 1057 | Learning
  efficient rotation representation for point cloud via local-global
  aggregation | Ruibin
  Gu (South China University of Technology); Qiuxia Wu (South China University
  of Technology, China)*; Hongbin Xu (South China University of Technology);
  Wing W.Y. Ng (South China University of Technology); Zhiyong Wang (The
  University of Sydney) | 
 
  | 371 | Model
  Compression via Collaborative Data-free Knowledge Distillation for Edge
  Intelligence | Zhiwei
  Hao (Beijing Institute of Technology)*; Yong Luo (Wuhan University); Zhi Wang
  (Tsinghua University); Han Hu (Beijing Institute of Technology, China);
  Jianping An (Beijing Institute of Technology) | 
 
  |  |  |  | 
 
  |  |  |  | 
 
  | O21 | Object/Person
  detection, Tracking and Recognition II | 
 
  | Time |  |  | 
 
  | Chair | Yu
  Zhou (Institute of Information Engineering, CAS) | 
 
  | ID | Title | Author | 
 
  | 547 | Multi-view
  Face Recognition using Deep Attention-based Face Frontalization | Xiao-Hu
  Shao (Chongqing Institute of Green and Intelligent Technology,Chinese Academy
  of Sciences; University of Chinese Academy of Sciences)*; Junliang Xing
  (Institute of Automation, Chinese Academy of Sciences); Ruihan Pan (Chongqing
  Institute of Green and Intelligent Technology, Chinese Academy of Sciences);
  Zhenghao Li (Chongqing Institute of Green and Intelligent Technology, Chinese
  Academy of Sciences); Xiang-Dong Zhou (Chongqing Institute of Green and
  Intelligent Technology,Chinese Academy of Sciences); Yu Shi (Chongqing
  Institute of Green and Intelligent Technology,Chinese Academy of Sciences) | 
 
  | 899 | CORE-Text:
  Improving Scene Text Detection with Contrastive Relational Reasoning | Jingyang
  Lin (Sun Yat-Sen University); Yingwei Pan (JD AI Research)*; Rongfeng Lai (JD
  AI Research); Xuehang Yang (JD AI Research); Hongyang Chao (Sun Yat-sen
  University); Ting Yao (JD AI Research) | 
 
  | 923 | SSDL:
  Self-Supervised Dictionary Learning | Shuai
  Shao (China University of Petroleum (East China) College of Control Science
  and Engineering); Lei Xing (China University of Petroleum(East China) College
  of Oceanography and Space Informatics); wei yu (Harbin Institute of
  Technology, School of computer science and technology); Rui Xu (China
  University of Petroleum (East China) College of Control Science and
  Engineering); yanjiang wang (China University of Petroleum (East China)
  College  of Control Science and
  Engineering); baodi liu (China University of Petroleum (East China) College
  of Information and Control Engineering)* | 
 
  | 974 | DeepMix:
  Online Auto Data Augmentation for Robust Visual Object Tracking | Ziyi
  Cheng (Kyushu University); Xuhong Ren (School of Computer Science and
  Engineering, Tianjin University of Technology); Felix Juefei-Xu (Alibaba
  Group, USA); Wanli Xue (Tianjin University of Technology)*; Qing Guo (Nanyang
  Technological University); Lei Ma (University of Alberta); Jianjun Zhao
  (Kyushu University) | 
 
  | 1006 | MATTING
  ENHANCED MASK R-CNN | Lufan
  Ma (Tsinghua University)*; Bin Dong (Southeast University); Jiangpeng Yan
  (Tsinghua University); Xiu Li (Tsinghua University) | 
 
  | 1036 | DEEP
  CORRELATION FILTERS FOR ROBUST VISUAL TRACKING | Xiang
  Liu (Dongguan University  of  Technology)* | 
 
  |  |  |  | 
 
  | O22 | Image/Video Synthesis and Creation II | 
 
  | Time |  | 
 
  | Chair | Ming-Ching
  Chang (University at Albany - SUNY) | 
 
  | ID | Title | Author | 
 
  | 403 | STAE:
  A SpatioTemporal Auto-Encoder for High-Resolution Video Prediction | Zheng
  Chang (University of Chinese Academy of Sciences )*; xinfeng zhang
  (University of Chinese Academy of Sciences); Shanshe Wang (Peking
  University); Siwei Ma (Peking University, China); Yan Ye (Alibaba Inc.); Wen
  Gao (PKU) | 
 
  | 439 | FEW-SHOT
  KNOWLEDGE TRANSFER FOR FINE-GRAINED CARTOON FACE GENERATION | Nan
  Zhuang (Peking University)*; Cheng Yang (ByteDance Inc.) | 
 
  | 817 | BargainNet:
  Background-Guided Domain Translation for Image Harmonization | Wenyan
  Cong (Shanghai Jiao Tong University); Li Niu (Shanghai Jiao Tong
  University)*; Jianfu Zhang (RIKEN AIP;Shanghai Jiao Tong University); Jing
  Liang (Shanghai Jiao Tong University); Liqing Zhang (Shanghai Jiao Tong
  University) | 
 
  | 1160 | DNA-NET:
  AGE AND GENDER AWARE KIN FACE SYNTHESIZER | Pengyu
  Gao (Southeast University); Joseph P Robinson (Northeastern University);
  Jiaxuan Zhu (Southeast University); Chao Xia (Shanghai Jiao Tong University);
  Ming Shao (University of Massachusetts Dartmouth); Siyu Xia (Southeast
  University, China)* | 
 
  | 1163 | Spatial
  Content Alignment For Pose Transfer | Wing
  Yin Yu (CITY UNIVERSITY OF HONG KONG)*; Lai-Man Po (CITY UNIVERSITY OF HONG
  KONG); Yuzhi Zhao (City University of Hong Kong); Jingjing Xiong (CITY
  UNIVERSITY OF HONG KONG); Kin Wai Lau (CITYU UNIVERSITY OF HONG KONG) | 
 
  | 1339 | INFRARED
  AND VISIBLE IMAGE FUSION BASED ON MODAL FEATURE FUSION NETWORK AND DUAL
  VISUAL DECISION | Yong
  Yang (School of Information Technology, Jiangxi University of Finance and
  Economics); Jiaxiang Liu (School of Information Technology, Jiangxi
  University of Finance and Economics)*; Shuying Huang (School of Software and
  Communication Engineering, Jiangxi University of Finance and Economics);
  Weiguo Wan (School of Software and Communication Engineering, Jiangxi
  University of Finance and Economics); Xiangkai Kong (School of Information
  Technology, Jiangxi University of Finance and Economics); Wang Zhang (	School
  of Information Technology, Jiangxi University of Finance and Economics) | 
 
  |  |  |  | 
 
  | O23 | Multimedia analysis and understanding I | 
 
  | Time |  | 
 
  | Chair | Bingpeng
  Ma (University of Chinese Academy of Sciences) | 
 
  | ID | Title | Author | 
 
  | 937 | Cross-scene
  Person Trajectory Anomaly Detection Based on Re-Identification | Yuanxun
  Li (Sun Yat-sen University, China); Ancong Wu (Sun Yat-sen University);
  WEI-SHI ZHENG (Sun Yat-sen University, China)* | 
 
  | 1073 | ACTION
  PREDICTION NETWORK WITH AUXILIARY OBSERVATION RATIO REGRESSION | Cuiwei
  Liu (Shenyang Aerospace University)*; Yiming Gao (Shenyang Aerospace
  University); Zhaokui Li (Shenyang Aerospace University); Chong Du (Shenyang
  Aircraft Design and Research Institute); Fang Liu (Shenyang Aerospace
  University;Northeastern University); Xiangbin Shi (Shenyang Aerospace
  University) | 
 
  | 1082 | GAIT
  IDENTIFICATION BASED ON HUMAN SKELETON WITH PAIRWISE GRAPH CONVOLUTIONAL
  NETWORK | Ke
  Xu (Shanghai Jiao Tong University)*; Xinghao Jiang (Shanghai Jiao Tong
  University); Tanfeng Sun (Shanghai Jiao Tong University) | 
 
  | 1119 | spatial
  reasoning and context-aware attention network for skeleton-based action
  recognition | Dianlong
  You (yanshan university); Ling Wang (yanshan university)*; Da Han (Cardiff
  University); Shunpan Liang (yanshan university); Hongyang Liu (yanshan
  university); Fuyong Yuan (yanshan university) | 
 
  | 1525 | Edge
  Enhancement Network for Weakly Supervised Semantic Segmentation | Mei
  Yu (Tianjin University); Junbin Wei (Tianjin University); Chenhan Wang
  (	Laboratory of OpenBayes Machine Intelligence Lab); Han Jiang (Laboratory of
  OpenBayes Machine Intelligence Lab); Jian Yu (Tianjin University); Ruixuan
  Zhang (College of Intelligence and Computing, Tianjin University); Xuewei Li
  (Tianjin University)*; Ruiguo Yu (Tianjin University) | 
 
  | 1587 | Associative
  Segmentation for Instances and Semantics by perceiving neighborhood in Point
  Clouds | Yingying
  Zhu (Shenzhen University); Biao Li (Shenzhen University); Qiang Huang
  (Shenzhen University)* | 
 
  |  |  |  | 
 
  | O24 | Multimedia
  interaction & Multimedia quality assessment | 
 
  | Time |  | 
 
  | Chair | Jong-Seok
  LEE (Yonsei University) | 
 
  | ID | Title | Author | 
 
  | 959 | FINE-GRAINED
  DISCOURSE FOR METAPHOR DETECTION | qimeng
  yang (xinjiang university)*; Long Yu (Xinjiang University); Shengwei Tian
  (Xinjiang University); jinmiao song (Xinjiang University) | 
 
  | 1128 | Facial
  Chirality: Using self-face reflection to learn discriminative features for
  facial expression recognition | Ling
  Lo (	National Chiao Tung University); Hong Xia Xie (National Chiao Tung
  University); Hong-Han Shuai (National Chiao Tung University); Wen-Huang Cheng
  (National Chiao Tung University)* | 
 
  | 106 | SKANET:
  STRUCTURED KNOWLEDGE-AWARE NETWORK FOR VISUAL DIALOG | Lei
  Zhao (The University of Electronic Science and Technology of China); Lianli
  Gao (The University of Electronic Science and Technology of China)*;
  Yuyu Guo (UESTC); Jingkuan Song (UESTC); Heng Tao Shen (University of
  Electronic Science and Technology of China (UESTC)) | 
 
  | 755 | A
  No-reference Evaluation Metric for Low-light Image Enhancement | Zicheng
  Zhang (Shanghai Jiaotong university)*; Wei Sun (Shanghai Jiao Tong
  Unviersity); Xiongkuo Min (Shanghai Jiao Tong University); Wenhan Zhu
  (Shanghai Jiao Tong University); Tao Wang (ShanghaiJiaotongUniversity); Wei
  Lu (Shanghai Jiao Tong University); Guangtao Zhai (Shanghai Jiao Tong
  University) | 
 
  | 1158 | DEEP
  NEURAL NETWORKS FOR END-TO-END SPATIOTEMPORAL VIDEO QUALITY PREDICTION AND
  AGGREGATION | Junming
  Chen (Peking University); Haiqiang Wang (Pengcheng Laboratory); Munan Xu
  (Shenzhen Graduate School, Peking University); Ge Li (SECE, Shenzhen Graduate
  School, Peking University)*; Shan Liu (Tencent America) | 
 
  | 1465 | No-Reference
  Deep Quality Assessment of Compressed Light Field Images | Zixuan
  Guo (Peking University); Wei Gao (Peking University & Peng Cheng
  Laboratory)*; Haiqiang Wang (Pengcheng Laboratory); Junle Wang (Tencent);
  Songlin Fan (Peking University ) | 
 
  |  |  |  | 
 
  | O25 | Multimedia security, privacy and
  forensic II | 
 
  | Time |  | 
 
  | Chair | Liang
  He (Tsinghua University) | 
 
  | ID | Title | Author | 
 
  | 18 | Blind
  Adversarial Pruning: Towards the Comprehensive Robust Models with Gradually
  Pruning Against Blind Adversarial Attacks | Haidong
  Xie (Qian Xuesen Laboratory, China Academy of Space Technology); Lixin Qian
  (	Wuhan University of Technology); Xueshuang Xiang (Qian Xuesen Laboratory of
  Space Technology)*; Naijin Liu (Qian Xuesen Laboratory, China Academy of
  Space Technology) | 
 
  | 695 | EFFICIENT
  OPEN-SET ADVERSARIAL ATTACKS ON DEEP FACE RECOGNITION | Haojie
  Yuan (University of Science and Technology of China); Qi Chu (University of
  Science and Technology of China)*; Feng Zhu (University of Science and
  Technology of China); Rui Zhao (SenseTime Group Limited); Bin Liu (University
  of Science and Technology of China); Nenghai Yu (University of Science and
  Technology of China) | 
 
  | 920 | CONTENT-INDEPENDENT
  ONLINE HANDWRITING VERIFICATION BASED ON MULTI-MODAL FUSION | Nan
  Ji (School of Cyberspace Security, University of Science and Technology of
  China); Bin Liu (University of Science and Technology of China)*; Zhiwei Zhao
  (University of Science and Technology of China); Yan Lu (University of
  Sydney); Qi Chu (University of Science and Technology of China); Zhenchao Jin
  (University of Science and Technology of China); Nenghai Yu (University of
  Science and Technology of China) | 
 
  | 1324 | On
  Generating JPEG Adversarial Images | Mengte
  Shi (Fudan University); Sheng Li (Fudan University); Zhaoxia Yin (Anhui
  University); Xinpeng Zhang (School of Computer Science, Fudan University)*;
  Zhenxing Qian (School of Computer Science, Fudan University) | 
 
  | 1459 | Transferable
  Adversarial Examples for Anchor Free Object Detection | quanyu
  liao (Chengdu University of Information Technology); Xin Wang (Keya Medical);
  bin kong (curacloud); Siwei Lyu (University at Buffalo); Bin Zhu (Microsoft
  Research Asia); youbing yin (Curacloud); qi  song (Curacloud); Xi Wu (Chengdu
  University of Information Technology)* | 
 
  |  |  |  | 
 
  |  |  |  | 
 
  | O26 | Special
  Session: Recent Advance in Depth-Related Processing and Applications | 
 
  | Time |  | 
 
  | Chair | Runmin
  Cong (Beijing Jiaotong University) | 
 
  | ID | Title | Author | 
 
  | 105 | SN-Graph:
  a Minimalist 3D Object Representation for Classification | Siyu
  Zhang (Donghua University); Hui Cao (Donghua University); Yuqi Liu (Donghua
  University); Shen Cai (Donghua University)*; Yanting Zhang (Donghua
  University); Yuanzhan Li (Donghua University); Xiaoyu Chi (Goertek Co., Ltd) | 
 
  | 259 | Stereo
  Superpixel Segmentation via Dual-attention Fusion Networks | Ruiqi
  Wu (Wuhan University of Technology); Yajuan Du (Wuhan University of
  Technology); Hua Li (Huazhong University of Science and Technology; City
  University of Hong Kong)*; Yucong Dai (Wuhan University of Technology) | 
 
  | 278 | IRS:
  A Large Naturalistic Indoor Robotics Stereo Dataset to Train Deep Models for
  Disparity and Surface Normal Estimation | Qiang
  Wang (Hong Kong Baptist University)*; Shizhen Zheng (HKBU); Qingsong Yan
  (Wuhan University); Fei Deng (Wuhan University); Kaiyong Zhao (Hong Kong
  Baptist University); Xiaowen Chu (Hong Kong Baptist University) | 
 
  | 1162 | QoE-based
  Neural Live Streaming Method With Continuous Dynamic Adaptive Video Quality
  Control | Xuekai
  WEI (City University of Hong Kong); Mingliang Zhou (Chongqing University)*;
  Sam Kwong (City Univeristy of Hong Kong); Hui Yuan (Shandong University); Tao
  Xiang (Chongqing University) | 
 
  | 1345 | DUAL
  REGULARIZATION BASED DEPTH MAP SUPER-RESOLUTION WITH GRAPH LAPLACIAN PRIOR | Longhua
  Sun (Beijing University of Technology); Jin Wang (Beijing University of
  Technology)*; Ruiqin Xiong (Peking University); Yunhui Shi (Beijing
  University of Technology); Qing Zhu (Beijing University of Technology);
  Baocai  Yin (Beijing University of
  Technology) | 
 
  |  |  |  | 
 
  |  |  |  | 
 
  | O27 | Image/Video
  Enhancement III | 
 
  | Time |  | 
 
  | Chair | Chau-Wai
  Wong (North Carolina State University) | 
 
  | ID | Title | Author | 
 
  | 1174 | Image
  demoireing with a dual-domain distilling network | Hailing
  Wang (Tianjin University); Qiaoyu Tian (Tianjin University); Liang Li
  (Tianjin University)*; Xiaojie Guo (Tianjin University) | 
 
  | 1176 | Contrastive
  Feature Decomposition for Image Reflection Removal | Xin
  Feng (Harbin Institute of Technology, Shenzhen); Haobo Ji (Harbin Institute
  of Technology,Shenzhen); Bo Jiang (Harbin Institute of Technology Shenzhen);
  Wenjie Pei (Harbin Institute of Technology, Shenzhen); Fanglin Chen (	Harbin
  Institute of Technology, Shenzhen); Guangming Lu ( Harbin Institute of
  Technology, Shenzhen)* | 
 
  | 1189 | RGB
  GUIDED DEPTH MAP SUPER-RESOLUTION WITH COUPLED U-NET | Yingjie
  Cui (Tsinghua University); Qingmin Liao (Tsinghua Univeristy)*; Wenming Yang
  (Tsinghua University); Jing-Hao Xue (University College London) | 
 
  | 1374 | Blur
  Invariant Kernel-Adaptive Network for Single Image Blind Deblurring | Sungkwon
  An (Seoul National University	); Hyungmin Roh (Seoul National University);
  Myungjoo Kang (Seoul National University)* | 
 
  | 1373 | STRUCTURAL
  PRIOR GUIDED IMAGE INPAINTING FOR COMPLEX SCENE | Shuxin
  Wei (Sun Yat-sen University); Chengying Gao (Sun Yat-sen University )* | 
 
  | 1517 | BWIN:
  A Bilateral Warping Method for Video Frame Interpolation | Fanyong
  Xue (Shanghai Jiao Tong University); Jie Li (Shanghai Jiao Tong University)*;
  Jiannan Liu (Shanghai Jiao Tong University); Chentao Wu (Shanghai Jiao Tong
  University) | 
 
  |  |  |  | 
 
  | O28 | Multimedia analysis and understanding
  II | 
 
  | Time |  | 
 
  | Chair | Liping
  Chen (Microsoft) | 
 
  | ID | Title | Author | 
 
  | 953 | A
  lightweight Saliency Prediction Model for Omnidirectional Images | dandan
  zhu (Shanghai Jiao Tong University)*; yongqing chen ( Hainan Air Traffic
  Management Sub-Bureau); Defang Zhao (Tongji University); Xiongkuo Min
  (Shanghai Jiao Tong University); Qiangqiang Zhou (Jiangxi Normal University);
  Shaobo Yu (East China Normal University); Guangtao Zhai (Shanghai Jiao Tong
  University); Xiaokang Yang (Shanghai Jiao Tong University) | 
 
  | 1277 | Multi-Scale
  Attention Constraint Network for Fine-Grained Visual Classification | Yaqing
  Hou (Dalian University of Technology)*; zhang wenkai (Dalian University of
  Technology); dongsheng zhou (dlu.edu.cn); Hongwei Ge (Dalian University of
  Technology); Qiang Zhang (Dalian University of Technology); Xiaopeng Wei
  (Dalian University of Technology) | 
 
  | 1314 | Multiple
  Hub-driven Attention Graph Network for Scene Graph Generation | Yang
  Yao (Sun Yat-sen University)*; Bo Gu (Sun Yat-sen University) | 
 
  | 1105 | HRDNet:
  High-resolution Detection Network for Small Objects | Ziming
  Liu (Inria); Guangyu Ryan Gao (Beijing Institute of Technology)*; Lin Sun
  (Samsung, USA); zhiyuan fang (Beijing Institute of Technology	) | 
 
  | 1157 | Meta-Graph
  Adaptation for Visual Object Tracking | Qiangqiang
  Wu (City University of Hong Kong); Antoni Chan (City University of Hong Kong,
  Hong, Kong)* | 
 
  | 1288 | CUTMIX
  DUAL BRANCH NETWORK FOR PERSON RE-IDENTIFICATION | Zengming
  Tang (Shanghai Advanced Research Institute, Chinese Academy of Sciences,
  Shanghai, China)*; Jun Huang (Shanghai Advanced Research Institute, Chinese
  Academy of Sciences) | 
 
  |  |  |  | 
 
  | O29 | Emerging
  multimedia applications of deep learning II | 
 
  | Time |  |  | 
 
  | Chair | Maggie
  Zhu (Purdue University) | 
 
  | ID | Title | Author | 
 
  | 895 | DEEP
  TIERED IMAGE SEGMENTATION FOR DETECTING INTERNAL ICE LAYERS IN RADAR IMAGERY | Yuchen
  Wang (Indiana University)*; Mingze Xu (Amazon); John Paden (University of
  Kansas); Lora Koenig (Univeristy of Colorado); Geoffrey  Charles Fox (Indiana University);
  David Crandall (Indiana University) | 
 
  | 947 | ATTENTION
  DRIVEN SELF-SIMILARITY CAPTURE FOR MOTION DEBLURRING | Jie
  Zhang (School of Computer Science, Fudan University); Chuanfa Zhang (Fudan
  University); Jiangzhou Wang (School of Computer Science, Fudan University);
  Qingyue Xiong (Fudan University); Yingtao Zhang (School of Computer Science,
  Fudan University); Wenqiang Zhang (Fudan University)* | 
 
  | 986 | Wide-sense
  Stationary Policy Optimization with Bellman Residual on Video Games | Chen
  Gong (Institute of Automation, Chinese Academy of Sciences)*; Qiang He
  (Institute of Automation, Chinese Academy of Sciences); Yunpeng Bai
  (Institute of Automation, Chinese Academy of Sciences); Xinwen Hou (Institute
  of Automation, Chinese Academy of Sciences); Guoliang Fan (Institute of
  Automation, Chinese Academy of Sciences); Yu Liu (Institute of Automation,
  Chinese Academy of Sciences) | 
 
  | 1076 | ACSNet:
  Adaptive Cross-scale Network with Feature Maps Refusion for Vehicle Density
  Detection | Zuhao
  Ge (Shantou University); Yuhui Li (Shantou University); Cheng Liang (Shantou
  University); Youyi Song (The Hong Kong Polytechnic University); Teng Zhou
  (Shantou University)*; Jing Qin (The Hong Kong Polytechnic University) | 
 
  | 1150 | Unsupervised
  Domain Adaptation via Cluster Alignment with Maximum Classifier Discrepancy | Mohamed
  Azzam (City University of Hong Kong); Si Wu (South China University of
  Technology); Aurele Tohokantche Gnanha 
  (City University of Hong Kong); Qianfen Jiao (City University of Hong
  Kong); Hau San Wong (City University of Hong Kong)* | 
 
  | 1495 | LIDAR-BASED
  REAL-TIME MAPPING FOR DIGITAL TWIN DEVELOPMENT | Evan
  Brock (University of Tennessee at Chattanooga); Chengxuan Huang (University
  of California, Davis); Dalei Wu (University of Tennessee at Chattanooga)*; Yu
  Liang (University of Tennessee at Chattanooga) | 
 
  |  |  |  | 
 
  | O30 | Multimedia
  Applications II | 
 
  | Time |  | 
 
  | Chair | Xinggong
  Zhang (Peking University) | 
 
  | ID | Title | Author | 
 
  | 204 | CoConv:
  Learning Dynamic Cooperative Convolution for Image Recognition | Kien
  X Nguyen (Texas Christian University); Tiffany Ryu (University of North
  Texas); Jocelyn Zhang (University of North Texas); Xu Ma (University of North
  Texas)*; Qing Yang (University of North Texas); Song Fu (University of North
  Texas); Paparao Palacharla (Fujitsu Network Communications); Nannan Wang
  (Fujitsu Network Communications); Xi Wang (Fujitsu Network Communications) | 
 
  | 1192 | DRL-based
  Collaborative Edge Content Replication with Popularity Distillation | Haopeng
  Yan (Tsinghua University); Zeming Chen (Tsinghua University); Zhi Wang
  (Tsinghua University); Wenwu Zhu (Tsinghua University)* | 
 
  | 656 | Handwriting
  Trajectory Recovery from Off-Line Multi-Stroke Characters by Deep Ordering
  Prediction and Heuristic Search | Tie-Qiang
  Wang (CASIA)*; Cheng-Lin Liu (Institute of Automation of Chinese Academy of
  Sciences) | 
 
  | 239 | CNN-Based
  Depth Map Prediction for Fast Block Partitioning in HEVC Intra Coding | Aolin
  Feng (University of Science and Technology of China); Changsheng Gao
  (University of Science and Technology of China); Li Li (University of Science
  and Technology of China); Dong Liu (University of Science and Technology of
  China)*; Feng Wu (University of Science and Technology of China) | 
 
  | 1463 | A
  REAL-TIME H.266/VVC SOFTWARE DECODER | Bin
  Zhu (Tencent America); Shan Liu (Tencent America); Yuan Liu (Tencent
  America); Yi Luo (Tencent America); Jing Ye (Tencent America); Haiyan Xu
  (Tencent America); Ying Huang (Tencent America); Hualong Jiao (Tencent
  America); Xiaozhong Xu (Tencent America)*; Xianguo Zhang (Tencent); Chenchen
  Gu (Tencent) | 
 
  | 16 | On
  Forecasting Dynamics in Online Discussion Forums | Chen
  Ling (University of Delaware); Di Cui (University of Delaware); Guangmo Tong
  (University of Delaware)*; Jianming ZHU (University of Chinese Academy of
  Sciences) | 
 
  |  |  |  | 
 
  | O31 | Special
  Session: Multimedia Knowledge-Driven Deep Analysis and Forensics/Security | 
 
  | Time |  | 
 
  | Chair | Chang-Tsun
  Li (Deakin University) | 
 
  | ID | Title | Author | 
 
  | 209 | Robust
  Image Denoising with Texture-Aware Neural Network | Bo
  Fu (Dalian University of Technology)*; Liyan Wang (Liaoning Normal
  University); Zhongxuan Luo (DALIAN UNIVERSITY OF TECHNOLOGY) | 
 
  | 555 | Multi-Graph
  Based Hierarchical Semantic Fusion for Cross-Modal Representation | Lei
  Zhu ()*; Chengyuan Zhang (Hunan University); Jiayu Song (Central South
  University); Liangcheng Liu (UniversityofMelbourne); Shichao Zhang ();
  Yangding Li (Hunan Normal University) | 
 
  | 96 | ON
  CONSTRUCTING A BETTER CORRELATION PREDICTOR FOR PRNU-BASED IMAGE FORGERY
  LOCALIZATION | Xufeng
  Lin (Deakin University)*; Chang-Tsun Li (Deakin University, Australia) | 
 
  | 337 | VideoForensicsHQ:
  Detecting High-quality Manipulated Face Videos | Gereon
  Fox (Max Planck Institute for Informatics)*; Wentao Liu (Max Planck Institute
  for Informatics); Hyeongwoo Kim (Max Planck Institute for Informatics);
  Hans-Peter Seidel (Max Planck Institute for Informatics); Mohamed Elgharib
  (Max Planck Institute for Informatics); Christian Theobalt (MPI Informatik) | 
 
  | 1185 | DEFAKEHOP:
  A LIGHT-WEIGHT HIGH-PERFORMANCE DEEPFAKE DETECTOR | Hong-Shuo
  Chen (USC)*; Mozhdeh Rouhsedaghat (University of Southern California); Hamza
  H Ghani (USC); Shuowen Hu (US Army Research Laboratory); Suya You (U.S. Army
  Research Laboratory); C.-C. Jay Kuo (USC) | 
 
  |  |  |  | 
 
  |  |  |  | 
 
  | O32 | Industry
  and Application Track I | 
 
  | Time |  | 
 
  | Chair | Zhang
  Wei (Singapore Institute of Technology) | 
 
  | ID | Title | Author | 
 
  | W11 | MT-GAN:
  A Training Framework to Enhance Image Classification Task with Image
  Translation | Qun
  Li (Microsoft)*; Changbo Hu (Microsoft); Keng-hao Chang (Microsoft); Ruofei
  Zhang (Microsoft) | 
 
  | W90 | A
  time-variant QoE model based on real video streaming data | Shengbin
  Meng (ByteDance Inc.)*; Minyin Zeng (ByteDance Inc.); Junlin Li (ByteDance
  Inc.); Yue Wang (Beijing ByteDance Technology Co., Ltd.); Zongming Guo
  (Peking University) | 
 
  | W111 | Hardware-aware
  Model Optimization Tool For Embedded Devices | Cagri
  Ozcinar (Samsung)*; Dongsun Kim (Samsung R&D Institute UK); Ben Rufus
  Duckworth (Samsung R&D Institute UK); Shayan Joya (Samsung R&D
  Institute UK); Nicolas  Scotto Di
  Perto (Samsung R&D Institute UK); Attila Dusnoki (University of Szeged);
  Márkó Fabó (University of Szeged	); Dániel Vince (University of Szeged);
  Gábor Lóki (University of Szeged	); Ákos Kiss (University of Szeged);
  Christopher Alder (Samsung R&D Institute UK) | 
 
  | W117 | MULTI-MODAL
  FUSION ENHANCED MODEL FOR DRIVER’S FACIAL EXPRESSION RECOGNITION | Jianrong
  Chen (University of California, San Diego)*; Sujit Dey (University of
  California, San Diego); Lei Wang (Qualcomm); Ning Bi (Qualcomm); Peng Liu
  (Qualcomm) | 
 
  |  |  |  | 
 
  |  |  |  | 
 
  |  |  |  | 
 
  | O33 | Image/video
  acquisition and compression | 
 
  | Time |  | 
 
  | Chair | Xin
  Zhao (Tencent) | 
 
  | ID | Title | Author | 
 
  | 698 | Thousand
  to One: Semantic Prior Modeling for Conceptual Coding | Jianhui
  Chang (Peking University)*; Zhenghui Zhao (Peking University); Lingbo Yang
  (Peking University); Chuanmin Jia (Peking University); Jian Zhang (Peking
  University Shenzhen Graduate School); Siwei Ma (Peking University, China) | 
 
  | 1063 | Spatial-Temporal
  Synergic Prior Driven Unfolding Network for Snapshot Compressive Imaging | Zhuoyuan
  Wu (PKU)*; Zhenyu Zhang (PKU); Jiechong Song (PKU); Jian Zhang (Peking
  University Shenzhen Graduate School) | 
 
  | 1068 | EFFICIENT
  VIDEO COMPRESSED SENSING RECONSTRUCTION VIA EXPLOITING SPATIAL-TEMPORAL
  CORRELATION WITH MEASUREMENT CONSTRAINT | Zhichao
  Wei (South China University of Technology)*; Chunling Yang (South China
  University of Technology	); Yunyi Xuan (South China University of Technology) | 
 
  | 1369 | Enhanced
  Implicit Selection of Transform Skip in AVS3 | liqiang
  wang (Tencent)*; Xiaozhong Xu (Tencent America); Shan Liu (Tencent America) | 
 
  | 205 | VANet:
  A View Attention Guided Network for 	3D Reconstruction from Single and
  Multi-view Images | Yi
  Yuan (NetEase Fuxi AI Lab)*; Jilin Tang (NetEase Fuxi AI Lab); Zhengxia Zou
  (University of Michigan) | 
 
  | 226 | DIFFERENTIABLE
  LIGHT-WEIGHT ARCHITECTURE SEARCH | Yuxu
  Mao (Ocean University of China); Guoqiang Zhong (Ocean University of China)*;
  Yanan Wang (Ocean University of China); Zhaoyang Deng (Ocean University of
  China) | 
 
  |  |  |  | 
 
  | O34 | Multimedia
  analysis and understanding III | 
 
  | Time |  | 
 
  | Chair | Ming-Ching
  Chang (University at Albany - SUNY) | 
 
  | ID | Title | Author | 
 
  | 1399 | MPN:
  Multimodal Parallel Network for Audio-Visual Event Localization | Jiashuo
  Yu (Fudan University)*; Ying Cheng (Fudan University); Rui Feng (Fudan
  University) | 
 
  | 1488 | Learning
  Content and Context with Language Bias for Visual Question Answering | Chao
  Yang (Hunan University)*; Su Feng (Hunan University); Dongsheng Li (Microsoft
  Research Asia); Huawei Shen (Institute of Computing Technology, Chinese
  Academy of Sciences); Guoqing Wang (Hunan University); Bin Jiang (Hunan
  University) | 
 
  | 627 | Efficient
  Human Pose Estimation by Learning Deeply Aggregated Representations | Zhengxiong
  Luo (Institute of Automation,Chinese Academy of Sciences)*; Zhicheng Wang
  (Megvii); Yuanhao Cai (Tsinghua Univisity, Tsinghua Shenzhen International
  Graduate School); Guan'an Wang (CASIA); Liang Wang (NLPR, China); Yan Huang
  (Institute of Automation, Chinese Academy of Sciences); Erjin Zhou (Megvii
  Research); Tieniu Tan (NLPR, China); Jian Sun (Megvii Technology) | 
 
  | 1180 | An
  Efficient Approach for Audio-Visual Emotion Recognition with Missing Labels
  and Missing Modalities | Fei
  Ma (Tsinghua-Berkeley Shenzhen Institute, Tsinghua University)*; Shao-Lun
  Huang (TBSI); Lin Zhang (Tsinghua University, China) | 
 
  | 1478 | ConSK-GCN:
  Conversational Semantic- and Knowledge-oriented  Graph Convolutional Network for
  Multimodal Emotion Recognition | Yahui
  Fu (Tianjin University)*; Shogo Okada (Japan Advanced Institute of Science
  and Technology); Longbiao Wang (Tianjin University); Lili Guo (Tianjin
  University); Yaodong Song (Tianjin University); Jiaxing Liu (Tianjin
  University); Jianwu Dang (Tianjin University) | 
 
  |  |  |  | 
 
  |  |  |  | 
 
  | O35 | Special
  Session: Advances in Language, Vision, and Limited Supervision | 
 
  | Time |  | 
 
  | Chair | Yi
  Cai (South China University of Technology) | 
 
  | ID | Title | Author | 
 
  | 459 | MNRE:
  A Challenge Multimodal Dataset for Neural Relation Extraction with Visual
  Evidence in Social Media Posts | Changmeng
  Zheng (South China University of Technology); Zhiwei Wu (School of Software
  Engineering, South China University of Technology); Junhao Feng (South China
  University of Technology); Ze Fu (School of Software Engineering, South China
  University of Technology); Yi Cai (School of Software Engineering, South
  China University of Technology)* | 
 
  | 585 | MULTIMODAL
  FUSION NETWORK WITH LATENT TOPIC MEMORY FOR RUMOR DETECTION | jiaxin
  chen (Guangdong University of Technology)*; Zekai Wu (Guangdong University of
  Technology	); Zhenguo Yang (Guangdong University of Technology); Haoran Xie
  (Lingnan University); Fu Lee Wang (The Open University of Hong Kong); Wenyin
  Liu (Guangdong University of Technology) | 
 
  | 887 | DCNet:
  Dual-task Cycle Network for End-to-End Image Dehazing | Zhihua
  Chen (East China University of Science and Technology); Yu Zhou (East China
  University of Science and Technology); Ping Li (The Hong Kong Polytechnic
  University); Xiaoyu Chi (Goertek Co., Ltd); Lei Ma (Peking University); Bin
  Sheng (Shanghai Jiao Tong University)* | 
 
  | 1394 | Person
  Retrieval in Physical World | Wenxin
  Huang (Hubei University)*; Dongyang Li (Wuhan University); Ruimin Hu (Wuhan
  University); Chao Liang (Wuhan University); Xian Zhong (Wuhan University of
  Technology) | 
 
  | 43 | Image
  Captioning with Inherent Sentiment | tong
  li (Beijing Institute of Technology)*; yunhui hu (Beijing Institute of
  Technology); Xinxiao Wu (Beijing Institute of Technology) | 
 
  |  |  |  | 
 
  |  |  |  | 
 
  | O36 | Industry
  and Application Track II | 
 
  | Time |  | 
 
  | Chair | Lukas
  Esterle (Aarhus University) | 
 
  | ID | Title | Author | 
 
  | W124 | EXTENDED
  GUIDED IMAGE FILTERING FOR CONTRAST ENHANCEMENT | JIAFEI
  WU (SenseTime Research)*; Gengjie Li (SenseTime Research); Chong Wang (Ningbo
  University); Huakai Liu (SenseTime Research); shuai zhang (Sensetime Ltd);
  Guangcheng Zhang (SenseTime Research) | 
 
  | W129 | Fine-Grained
  Texture Identification for Reliable Product Traceability | Junsong
  Wang (Easy-Visible)*; Yubo Li (V-Origin Technology); ZhiYong Chang (V-Origin
  Technology); Haitao Yue,(V-Origin Technology); Yonghua Lin (V-Origin
  Technology) | 
 
  | W132 | A
  LIGHTWEIGHT APPROACH FOR WOOD HYPERSPECTRAL IMAGES CLASSIFICATION | Phyu
  Phyu Htun (University of Computer Studies, Yangon.)*; Marco Boschetti
  (Microtec srl GmbH); Attaullah Buriro (University of Bolzano); Roberto
  Confalonieri (Free University of Bozen-Bolzano); Boyuan Sun (Free University
  of Bolzano); Ah Nge Htwe (University of Computer Studies, Yangon.); Tammam
  Tillo (Indraprastha Institute of Information Technology Delhi) | 
 
  | W138 | Low
  Complexity Implementation of Intra String Copy in AVS3 | Yingbin
  Wang (Tencent); Xiaozhong Xu (Tencent America)*; Shan Liu (Tencent America) | 
 
 
  |  |  |  |