Skip to main content

Showing 1–50 of 89 results for author: Shu, X

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.07483  [pdf, other

    cs.HC

    Visualization Atlases: Explaining and Exploring Complex Topics through Data, Visualization, and Narration

    Authors: Jinrui Wang, Xinhuan Shu, Benjamin Bach, Uta Hinrichs

    Abstract: This paper defines, analyzes, and discusses the emerging genre of visualization atlases. We currently witness an increase in web-based, data-driven initiatives that call themselves "atlases" while explaining complex, contemporary issues through data and visualizations: climate change, sustainability, AI, or cultural discoveries. To understand this emerging genre and inform their design, study, and… ▽ More

    Submitted 14 August, 2024; originally announced August 2024.

  2. arXiv:2408.01272  [pdf, other

    cs.HC

    Does This Have a Particular Meaning? Interactive Pattern Explanation for Network Visualizations

    Authors: Xinhuan Shu, Alexis Pister, Junxiu Tang, Fanny Chevalier, Benjamin Bach

    Abstract: This paper presents an interactive technique to explain visual patterns in network visualizations to analysts who do not understand these visualizations and who are learning to read them. Learning a visualization requires mastering its visual grammar and decoding information presented through visual marks, graphical encodings, and spatial configurations. To help people learn network visualization… ▽ More

    Submitted 2 August, 2024; originally announced August 2024.

    Comments: to be published in IEEE VIS 2024

  3. arXiv:2407.19727  [pdf, other

    cs.IR

    Adaptive Utilization of Cross-scenario Information for Multi-scenario Recommendation

    Authors: Xiufeng Shu, Ruidong Han, Xiang Li, Wei Lin

    Abstract: Recommender system of the e-commerce platform usually serves multiple business scenarios. Multi-scenario Recommendation (MSR) is an important topic that improves ranking performance by leveraging information from different scenarios. Recent methods for MSR mostly construct scenario shared or specific modules to model commonalities and differences among scenarios. However, when the amount of data a… ▽ More

    Submitted 29 July, 2024; originally announced July 2024.

  4. arXiv:2405.17188  [pdf, other

    cs.CV

    The SkatingVerse Workshop & Challenge: Methods and Results

    Authors: Jian Zhao, Lei Jin, Jianshu Li, Zheng Zhu, Yinglei Teng, Jiaojiao Zhao, Sadaf Gulshad, Zheng Wang, Bo Zhao, Xiangbo Shu, Yunchao Wei, Xuecheng Nie, Xiaojie Jin, Xiaodan Liang, Shin'ichi Satoh, Yandong Guo, Cewu Lu, Junliang Xing, Jane Shen Shengmei

    Abstract: The SkatingVerse Workshop & Challenge aims to encourage research in developing novel and accurate methods for human action understanding. The SkatingVerse dataset used for the SkatingVerse Challenge has been publicly released. There are two subsets in the dataset, i.e., the training subset and testing subset. The training subsets consists of 19,993 RGB video sequences, and the testing subsets cons… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  5. arXiv:2405.02538  [pdf, other

    cs.CV

    AdaFPP: Adapt-Focused Bi-Propagating Prototype Learning for Panoramic Activity Recognition

    Authors: Meiqi Cao, Rui Yan, Xiangbo Shu, Guangzhao Dai, Yazhou Yao, Guo-Sen Xie

    Abstract: Panoramic Activity Recognition (PAR) aims to identify multi-granularity behaviors performed by multiple persons in panoramic scenes, including individual activities, group activities, and global activities. Previous methods 1) heavily rely on manually annotated detection boxes in training and inference, hindering further practical deployment; or 2) directly employ normal detectors to detect multip… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  6. arXiv:2405.02077  [pdf, other

    cs.CV

    MVP-Shot: Multi-Velocity Progressive-Alignment Framework for Few-Shot Action Recognition

    Authors: Hongyu Qu, Rui Yan, Xiangbo Shu, Hailiang Gao, Peng Huang, Guo-Sen Xie

    Abstract: Recent few-shot action recognition (FSAR) methods typically perform semantic matching on learned discriminative features to achieve promising performance. However, most FSAR methods focus on single-scale (e.g., frame-level, segment-level, etc) feature alignment, which ignores that human actions with the same semantic may appear at different velocities. To this end, we develop a novel Multi-Velocit… ▽ More

    Submitted 23 May, 2024; v1 submitted 3 May, 2024; originally announced May 2024.

  7. arXiv:2401.10039  [pdf, other

    cs.CV

    GPT4Ego: Unleashing the Potential of Pre-trained Models for Zero-Shot Egocentric Action Recognition

    Authors: Guangzhao Dai, Xiangbo Shu, Wenhao Wu, Rui Yan, Jiachao Zhang

    Abstract: Vision-Language Models (VLMs), pre-trained on large-scale datasets, have shown impressive performance in various visual recognition tasks. This advancement paves the way for notable performance in Zero-Shot Egocentric Action Recognition (ZS-EAR). Typically, VLMs handle ZS-EAR as a global video-text matching task, which often leads to suboptimal alignment of vision and linguistic knowledge. We prop… ▽ More

    Submitted 11 May, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

  8. arXiv:2312.11225  [pdf, other

    cs.CR

    MAD-MulW: A Multi-Window Anomaly Detection Framework for BGP Security Events

    Authors: Songtao Peng, Yiping Chen, Xincheng Shu, Wu Shuai, Shenhao Fang, Zhongyuan Ruan, Qi Xuan

    Abstract: In recent years, various international security events have occurred frequently and interacted between real society and cyberspace. Traditional traffic monitoring mainly focuses on the local anomalous status of events due to a large amount of data. BGP-based event monitoring makes it possible to perform differential analysis of international events. For many existing traffic anomaly detection meth… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

    Comments: 10 pages, 8 figures

  9. arXiv:2309.06902  [pdf, other

    cs.CV

    CCSPNet-Joint: Efficient Joint Training Method for Traffic Sign Detection Under Extreme Conditions

    Authors: Haoqin Hong, Yue Zhou, Xiangyu Shu, Xiaofang Hu

    Abstract: Traffic sign detection is an important research direction in intelligent driving. Unfortunately, existing methods often overlook extreme conditions such as fog, rain, and motion blur. Moreover, the end-to-end training strategy for image denoising and object detection models fails to utilize inter-model information effectively. To address these issues, we propose CCSPNet, an efficient feature extra… ▽ More

    Submitted 3 February, 2024; v1 submitted 13 September, 2023; originally announced September 2023.

  10. arXiv:2308.15795  [pdf, other

    cs.CV

    Occlusion-Aware Detection and Re-ID Calibrated Network for Multi-Object Tracking

    Authors: Yukun Su, Ruizhou Sun, Xin Shu, Yu Zhang, Qingyao Wu

    Abstract: Multi-Object Tracking (MOT) is a crucial computer vision task that aims to predict the bounding boxes and identities of objects simultaneously. While state-of-the-art methods have made remarkable progress by jointly optimizing the multi-task problems of detection and Re-ID feature learning, yet, few approaches explore to tackle the occlusion issue, which is a long-standing challenge in the MOT fie… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

  11. arXiv:2308.14105  [pdf, other

    cs.CV cs.AI

    Unified and Dynamic Graph for Temporal Character Grouping in Long Videos

    Authors: Xiujun Shu, Wei Wen, Liangsheng Xu, Ruizhi Qiao, Taian Guo, Hanjun Li, Bei Gan, Xiao Wang, Xing Sun

    Abstract: Video temporal character grouping locates appearing moments of major characters within a video according to their identities. To this end, recent works have evolved from unsupervised clustering to graph-based supervised clustering. However, graph methods are built upon the premise of fixed affinity graphs, bringing many inexact connections. Besides, they extract multi-modal features with kinds of… ▽ More

    Submitted 22 June, 2024; v1 submitted 27 August, 2023; originally announced August 2023.

  12. arXiv:2308.04197  [pdf, other

    cs.CV

    D3G: Exploring Gaussian Prior for Temporal Sentence Grounding with Glance Annotation

    Authors: Hanjun Li, Xiujun Shu, Sunan He, Ruizhi Qiao, Wei Wen, Taian Guo, Bei Gan, Xing Sun

    Abstract: Temporal sentence grounding (TSG) aims to locate a specific moment from an untrimmed video with a given natural language query. Recently, weakly supervised methods still have a large performance gap compared to fully supervised ones, while the latter requires laborious timestamp annotations. In this study, we aim to reduce the annotation cost yet keep competitive performance for TSG task compared… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

    Comments: ICCV2023

  13. arXiv:2308.04040  [pdf, other

    cs.HC

    WonderFlow: Narration-Centric Design of Animated Data Videos

    Authors: Yun Wang, Leixian Shen, Zhengxin You, Xinhuan Shu, Bongshin Lee, John Thompson, Haidong Zhang, Dongmei Zhang

    Abstract: Creating an animated data video enriched with audio narration takes a significant amount of time and effort and requires expertise. Users not only need to design complex animations, but also turn written text scripts into audio narrations and synchronize visual changes with the narrations. This paper presents WonderFlow, an interactive authoring tool, that facilitates narration-centric design of a… ▽ More

    Submitted 6 June, 2024; v1 submitted 8 August, 2023; originally announced August 2023.

    Comments: Accepted by TVCG

  14. arXiv:2307.11957  [pdf, other

    physics.optics cs.CV cs.ET cs.LG

    High-performance real-world optical computing trained by in situ model-free optimization

    Authors: Guangyuan Zhao, Xin Shu, Renjie Zhou

    Abstract: Optical computing systems provide high-speed and low-energy data processing but face deficiencies in computationally demanding training and simulation-to-reality gaps. We propose a gradient-based model-free optimization (G-MFO) method based on a Monte Carlo gradient estimation algorithm for computationally efficient in situ training of optical computing systems. This approach treats an optical com… ▽ More

    Submitted 2 April, 2024; v1 submitted 21 July, 2023; originally announced July 2023.

  15. arXiv:2307.04139  [pdf, ps, other

    cs.DS

    A Randomized Algorithm for Single-Source Shortest Path on Undirected Real-Weighted Graphs

    Authors: Ran Duan, Jiayi Mao, Xinkai Shu, Longhui Yin

    Abstract: In undirected graphs with real non-negative weights, we give a new randomized algorithm for the single-source shortest path (SSSP) problem with running time $O(m\sqrt{\log n \cdot \log\log n})$ in the comparison-addition model. This is the first algorithm to break the $O(m+n\log n)$ time bound for real-weighted sparse graphs by Dijkstra's algorithm with Fibonacci heaps. Previous undirected non-neg… ▽ More

    Submitted 4 October, 2023; v1 submitted 9 July, 2023; originally announced July 2023.

    Comments: 17 pages

    MSC Class: 68W20 ACM Class: F.2.2

  16. Creating Emordle: Animating Word Cloud for Emotion Expression

    Authors: Liwenhan Xie, Xinhuan Shu, Jeon Cheol Su, Yun Wang, Siming Chen, Huamin Qu

    Abstract: We propose emordle, a conceptual design that animates wordles (compact word clouds) to deliver their emotional context to the audiences. To inform the design, we first reviewed online examples of animated texts and animated wordles, and summarized strategies for injecting emotion into the animations. We introduced a composite approach that extends an existing animation scheme for one word to multi… ▽ More

    Submitted 14 June, 2023; v1 submitted 13 June, 2023; originally announced June 2023.

    Comments: Accepted in IEEE Transactions on Visualization and Computer Graphics

  17. arXiv:2304.04023  [pdf, other

    cs.CV

    Attack is Good Augmentation: Towards Skeleton-Contrastive Representation Learning

    Authors: Binqian Xu, Xiangbo Shu, Rui Yan, Guo-Sen Xie, Yixiao Ge, Mike Zheng Shou

    Abstract: Contrastive learning, relying on effective positive and negative sample pairs, is beneficial to learn informative skeleton representations in unsupervised skeleton-based action recognition. To achieve these positive and negative pairs, existing weak/strong data augmentation methods have to randomly change the appearance of skeletons for indirectly pursuing semantic perturbations. However, such app… ▽ More

    Submitted 8 April, 2023; originally announced April 2023.

  18. arXiv:2304.01647  [pdf, other

    cs.CV cs.AI

    SC-ML: Self-supervised Counterfactual Metric Learning for Debiased Visual Question Answering

    Authors: Xinyao Shu, Shiyang Yan, Xu Yang, Ziheng Wu, Zhongfeng Chen, Zhenyu Lu

    Abstract: Visual question answering (VQA) is a critical multimodal task in which an agent must answer questions according to the visual cue. Unfortunately, language bias is a common problem in VQA, which refers to the model generating answers only by associating with the questions while ignoring the visual content, resulting in biased results. We tackle the language bias problem by proposing a self-supervis… ▽ More

    Submitted 4 April, 2023; originally announced April 2023.

  19. arXiv:2303.14768  [pdf, other

    cs.CV

    Collaborative Noisy Label Cleaner: Learning Scene-aware Trailers for Multi-modal Highlight Detection in Movies

    Authors: Bei Gan, Xiujun Shu, Ruizhi Qiao, Haoqian Wu, Keyu Chen, Hanjun Li, Bo Ren

    Abstract: Movie highlights stand out of the screenplay for efficient browsing and play a crucial role on social media platforms. Based on existing efforts, this work has two observations: (1) For different annotators, labeling highlight has uncertainty, which leads to inaccurate and time-consuming annotations. (2) Besides previous supervised or unsupervised settings, some existing video corpora can be usefu… ▽ More

    Submitted 26 March, 2023; originally announced March 2023.

    Comments: Accepted to CVPR2023

  20. Work with AI and Work for AI: Autonomous Vehicle Safety Drivers' Lived Experiences

    Authors: Mengdi Chu, Keyu Zong, Xin Shu, Jiangtao Gong, Zicong Lu, Kaimin Guo, Xinyi Dai, Guyue Zhou

    Abstract: The development of Autonomous Vehicle (AV) has created a novel job, the safety driver, recruited from experienced drivers to supervise and operate AV in numerous driving missions. Safety drivers usually work with non-perfect AV in high-risk real-world traffic environments for road testing tasks. However, this group of workers is under-explored in the HCI community. To fill this gap, we conducted s… ▽ More

    Submitted 8 March, 2023; originally announced March 2023.

    Comments: 17 pages, 2 figures

    MSC Class: J.4

    Journal ref: CHI 2023

  21. arXiv:2302.02327  [pdf, other

    cs.CV

    Pyramid Self-attention Polymerization Learning for Semi-supervised Skeleton-based Action Recognition

    Authors: Binqian Xu, Xiangbo Shu

    Abstract: Most semi-supervised skeleton-based action recognition approaches aim to learn the skeleton action representations only at the joint level, but neglect the crucial motion characteristics at the coarser-grained body (e.g., limb, trunk) level that provide rich additional semantic information, though the number of labeled data is limited. In this work, we propose a novel Pyramid Self-attention Polyme… ▽ More

    Submitted 5 February, 2023; originally announced February 2023.

  22. arXiv:2302.02316  [pdf, other

    cs.CV

    Spatiotemporal Decouple-and-Squeeze Contrastive Learning for Semi-Supervised Skeleton-based Action Recognition

    Authors: Binqian Xu, Xiangbo Shu

    Abstract: Contrastive learning has been successfully leveraged to learn action representations for addressing the problem of semi-supervised skeleton-based action recognition. However, most contrastive learning-based methods only contrast global features mixing spatiotemporal information, which confuses the spatial- and temporal-specific information reflecting different semantic at the frame level and joint… ▽ More

    Submitted 5 February, 2023; originally announced February 2023.

  23. arXiv:2301.09498  [pdf, other

    cs.CV

    Triplet Contrastive Representation Learning for Unsupervised Vehicle Re-identification

    Authors: Fei Shen, Xiaoyu Du, Liyan Zhang, Xiangbo Shu, Jinhui Tang

    Abstract: Part feature learning is critical for fine-grained semantic understanding in vehicle re-identification. However, existing approaches directly model part features and global features, which can easily lead to serious gradient vanishing issues due to their unequal feature information and unreliable pseudo-labels for unsupervised vehicle re-identification. To address this problem, in this paper, we p… ▽ More

    Submitted 15 March, 2023; v1 submitted 23 January, 2023; originally announced January 2023.

  24. arXiv:2301.06309  [pdf, other

    cs.CV

    UATVR: Uncertainty-Adaptive Text-Video Retrieval

    Authors: Bo Fang, Wenhao Wu, Chang Liu, Yu Zhou, Yuxin Song, Weiping Wang, Xiangbo Shu, Xiangyang Ji, Jingdong Wang

    Abstract: With the explosive growth of web videos and emerging large-scale vision-language pre-training models, e.g., CLIP, retrieving videos of interest with text instructions has attracted increasing attention. A common practice is to transfer text-video pairs to the same embedding space and craft cross-modal interactions with certain entities in specific granularities for semantic correspondence. Unfortu… ▽ More

    Submitted 18 August, 2023; v1 submitted 16 January, 2023; originally announced January 2023.

    Comments: To appear at ICCV2023

  25. arXiv:2212.06348  [pdf, other

    cs.CV

    Dilation-Erosion for Single-Frame Supervised Temporal Action Localization

    Authors: Bin Wang, Yan Song, Fanming Wang, Yang Zhao, Xiangbo Shu, Yan Rui

    Abstract: To balance the annotation labor and the granularity of supervision, single-frame annotation has been introduced in temporal action localization. It provides a rough temporal location for an action but implicitly overstates the supervision from the annotated-frame during training, leading to the confusion between actions and backgrounds, i.e., action incompleteness and background false positives. T… ▽ More

    Submitted 12 December, 2022; originally announced December 2022.

    Comments: 28 pages, 8 figures

  26. arXiv:2211.14502  [pdf, other

    cs.CV

    Learning Single Image Defocus Deblurring with Misaligned Training Pairs

    Authors: Yu Li, Dongwei Ren, Xinya Shu, Wangmeng Zuo

    Abstract: By adopting popular pixel-wise loss, existing methods for defocus deblurring heavily rely on well aligned training image pairs. Although training pairs of ground-truth and blurry images are carefully collected, e.g., DPDD dataset, misalignment is inevitable between training pairs, making existing methods possibly suffer from deformation artifacts. In this paper, we propose a joint deblurring and r… ▽ More

    Submitted 29 November, 2022; v1 submitted 26 November, 2022; originally announced November 2022.

    Comments: https://github.com/liyucs/JDRL

  27. arXiv:2211.08894  [pdf, other

    cs.CV

    AdaTriplet-RA: Domain Matching via Adaptive Triplet and Reinforced Attention for Unsupervised Domain Adaptation

    Authors: Xinyao Shu, Shiyang Yan, Zhenyu Lu, Xinshao Wang, Yuan Xie

    Abstract: Unsupervised domain adaption (UDA) is a transfer learning task where the data and annotations of the source domain are available but only have access to the unlabeled target data during training. Most previous methods try to minimise the domain gap by performing distribution alignment between the source and target domains, which has a notable limitation, i.e., operating at the domain level, but ne… ▽ More

    Submitted 16 November, 2022; originally announced November 2022.

  28. arXiv:2211.03077  [pdf, ps, other

    cs.DS

    Online Nash Welfare Maximization Without Predictions

    Authors: Zhiyi Huang, Minming Li, Xinkai Shu, Tianze Wei

    Abstract: The maximization of Nash welfare, which equals the geometric mean of agents' utilities, is widely studied because it balances efficiency and fairness in resource allocation problems. Banerjee, Gkatzelis, Gorokh, and Jin (2022) recently introduced the model of online Nash welfare maximization for $T$ divisible items and $N$ agents with additive utilities with predictions of each agent's utility for… ▽ More

    Submitted 10 February, 2023; v1 submitted 6 November, 2022; originally announced November 2022.

  29. arXiv:2210.14231  [pdf

    eess.IV cs.LG

    NAS-PRNet: Neural Architecture Search generated Phase Retrieval Net for Off-axis Quantitative Phase Imaging

    Authors: Xin Shu, Mengxuan Niu, Yi Zhang, Renjie Zhou

    Abstract: Single neural networks have achieved simultaneous phase retrieval with aberration compensation and phase unwrapping in off-axis Quantitative Phase Imaging (QPI). However, when designing the phase retrieval neural network architecture, the trade-off between computation latency and accuracy has been largely neglected. Here, we propose Neural Architecture Search (NAS) generated Phase Retrieval Net (N… ▽ More

    Submitted 25 October, 2022; originally announced October 2022.

  30. arXiv:2209.05739  [pdf, other

    cs.HC

    MetaGlyph: Automatic Generation of Metaphoric Glyph-based Visualization

    Authors: Lu Ying, Xinhuan Shu, Dazhen Deng, Yuchen Yang, Tan Tang, Lingyun Yu, Yingcai Wu

    Abstract: Glyph-based visualization achieves an impressive graphic design when associated with comprehensive visual metaphors, which help audiences effectively grasp the conveyed information through revealing data semantics. However, creating such metaphoric glyph-based visualization (MGV) is not an easy task, as it requires not only a deep understanding of data but also professional design skills. This pap… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

  31. arXiv:2208.09374  [pdf, other

    cs.CV

    VLMAE: Vision-Language Masked Autoencoder

    Authors: Sunan He, Taian Guo, Tao Dai, Ruizhi Qiao, Chen Wu, Xiujun Shu, Bo Ren

    Abstract: Image and language modeling is of crucial importance for vision-language pre-training (VLP), which aims to learn multi-modal representations from large-scale paired image-text data. However, we observe that most existing VLP methods focus on modeling the interactions between image and text features while neglecting the information disparity between image and text, thus suffering from focal bias. T… ▽ More

    Submitted 19 August, 2022; originally announced August 2022.

    Comments: 12 pages, 7 figures

  32. arXiv:2208.08608  [pdf, other

    cs.CV

    See Finer, See More: Implicit Modality Alignment for Text-based Person Retrieval

    Authors: Xiujun Shu, Wei Wen, Haoqian Wu, Keyu Chen, Yiran Song, Ruizhi Qiao, Bo Ren, Xiao Wang

    Abstract: Text-based person retrieval aims to find the query person based on a textual description. The key is to learn a common latent space mapping between visual-textual modalities. To achieve this goal, existing works employ segmentation to obtain explicitly cross-modal alignments or utilize attention to explore salient alignments. These methods have two shortcomings: 1) Labeling cross-modal alignments… ▽ More

    Submitted 25 August, 2022; v1 submitted 17 August, 2022; originally announced August 2022.

    Comments: Accepted at ECCV Workshop on Real-World Surveillance (RWS 2022)

  33. arXiv:2208.06179  [pdf, other

    cs.CV cs.AI

    Exploiting Feature Diversity for Make-up Temporal Video Grounding

    Authors: Xiujun Shu, Wei Wen, Taian Guo, Sunan He, Chen Wu, Ruizhi Qiao

    Abstract: This technical report presents the 3rd winning solution for MTVG, a new task introduced in the 4-th Person in Context (PIC) Challenge at ACM MM 2022. MTVG aims at localizing the temporal boundary of the step in an untrimmed video based on a textual description. The biggest challenge of this task is the fi ne-grained video-text semantics of make-up steps. However, current methods mainly extract vid… ▽ More

    Submitted 12 August, 2022; originally announced August 2022.

    Comments: 3st Place in PIC Makeup Temporal Video Grounding (MTVG) Challenge in ACM-MM 2022

  34. arXiv:2203.02883  [pdf, ps, other

    cs.DS cs.GT

    The Power of Multiple Choices in Online Stochastic Matching

    Authors: Zhiyi Huang, Xinkai Shu, Shuyi Yan

    Abstract: We study the power of multiple choices in online stochastic matching. Despite a long line of research, existing algorithms still only consider two choices of offline neighbors for each online vertex because of the technical challenge in analyzing multiple choices. This paper introduces two approaches for designing and analyzing algorithms that use multiple choices. For unweighted and vertex-weight… ▽ More

    Submitted 6 March, 2022; originally announced March 2022.

    Comments: STOC 2022

  35. arXiv:2112.12793  [pdf, other

    cs.LG cs.AI cs.NI

    A Multi-View Framework for BGP Anomaly Detection via Graph Attention Network

    Authors: Songtao Peng, Jiaqi Nie, Xincheng Shu, Zhongyuan Ruan, Lei Wang, Yunxuan Sheng, Qi Xuan

    Abstract: As the default protocol for exchanging routing reachability information on the Internet, the abnormal behavior in traffic of Border Gateway Protocols (BGP) is closely related to Internet anomaly events. The BGP anomalous detection model ensures stable routing services on the Internet through its real-time monitoring and alerting capabilities. Previous studies either focused on the feature selectio… ▽ More

    Submitted 23 December, 2021; originally announced December 2021.

    Comments: 12 pages, 8 figures

  36. arXiv:2112.10992  [pdf, other

    cs.CV stat.ML

    Expansion-Squeeze-Excitation Fusion Network for Elderly Activity Recognition

    Authors: Xiangbo Shu, Jiawen Yang, Rui Yan, Yan Song

    Abstract: This work focuses on the task of elderly activity recognition, which is a challenging task due to the existence of individual actions and human-object interactions in elderly activities. Thus, we attempt to effectively aggregate the discriminative information of actions and interactions from both RGB videos and skeleton sequences by attentively fusing multi-modal features. Recently, some nonlinear… ▽ More

    Submitted 24 April, 2022; v1 submitted 21 December, 2021; originally announced December 2021.

  37. arXiv:2111.13888  [pdf, other

    cs.CV

    Head and Body: Unified Detector and Graph Network for Person Search in Media

    Authors: Xiujun Shu, Yusheng Tao, Ruizhi Qiao, Bo Ke, Wei Wen, Bo Ren

    Abstract: Person search in media has seen increasing potential in Internet applications, such as video clipping and character collection. This task is common but overlooked by previous person search works which focus on surveillance scenes. The media scenarios have some different challenges from surveillance scenes. For example, a person may change his clothes frequently. To alleviate this issue, this paper… ▽ More

    Submitted 27 November, 2021; originally announced November 2021.

  38. Learning to Disentangle Scenes for Person Re-identification

    Authors: Xianghao Zang, Ge Li, Wei Gao, Xiujun Shu

    Abstract: There are many challenging problems in the person re-identification (ReID) task, such as the occlusion and scale variation. Existing works usually tried to solve them by employing a one-branch network. This one-branch network needs to be robust to various challenging problems, which makes this network overburdened. This paper proposes to divide-and-conquer the ReID task. For this purpose, we emplo… ▽ More

    Submitted 28 February, 2022; v1 submitted 9 November, 2021; originally announced November 2021.

    Comments: Preprint Version; Accepted by Image and Vision Computing

    Journal ref: Image and Vision Computing 2021

  39. Exploiting Robust Unsupervised Video Person Re-identification

    Authors: Xianghao Zang, Ge Li, Wei Gao, Xiujun Shu

    Abstract: Unsupervised video person re-identification (reID) methods usually depend on global-level features. And many supervised reID methods employed local-level features and achieved significant performance improvements. However, applying local-level features to unsupervised methods may introduce an unstable performance. To improve the performance stability for unsupervised video reID, this paper introdu… ▽ More

    Submitted 12 February, 2022; v1 submitted 9 November, 2021; originally announced November 2021.

    Comments: Preprint version; Accepted by IET Image Processing

    Journal ref: IET Image Processing 2022

  40. arXiv:2110.01848  [pdf, other

    cs.IT cs.AI cs.LG

    Cellular Network Radio Propagation Modeling with Deep Convolutional Neural Networks

    Authors: Xin Zhang, Xiujun Shu, Bingwen Zhang, Jie Ren, Lizhou Zhou, Xin Chen

    Abstract: Radio propagation modeling and prediction is fundamental for modern cellular network planning and optimization. Conventional radio propagation models fall into two categories. Empirical models, based on coarse statistics, are simple and computationally efficient, but are inaccurate due to oversimplification. Deterministic models, such as ray tracing based on physical laws of wave propagation, are… ▽ More

    Submitted 5 October, 2021; originally announced October 2021.

    Journal ref: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, August, 2020, Pages 2378

  41. Data Acquisition and Preparation for Dual-reference Deep Learning of Image Super-Resolution

    Authors: Yanhui Guo, Xiaolin Wu, Xiao Shu

    Abstract: The performance of deep learning based image super-resolution (SR) methods depend on how accurately the paired low and high resolution images for training characterize the sampling process of real cameras. Low and high resolution (LR$\sim$HR) image pairs synthesized by degradation models (e.g., bicubic downsampling) deviate from those in reality; thus the synthetically-trained DCNN SR models work… ▽ More

    Submitted 19 June, 2022; v1 submitted 4 August, 2021; originally announced August 2021.

    Comments: Accepted by IEEE Transactions on Image Processing (TIP)

  42. arXiv:2107.13504  [pdf, other

    cs.NI

    Inferring Multiple Relationships between ASes using Graph Convolutional Network

    Authors: Songtao Peng, Xincheng Shu, Zhongyuan Ruan, Zegang Huang, Qi Xuan

    Abstract: Precisely understanding the business relationships between Autonomous Systems (ASes) is essential for studying the Internet structure. So far, many inference algorithms have been proposed to classify the AS relationships, which mainly focus on Peer-Peer (P2P) and Provider-Customer (P2C) binary classification and achieved excellent results. However, there are other types of AS relationships in actu… ▽ More

    Submitted 28 July, 2021; originally announced July 2021.

    Comments: 12 pages, 10 figures

  43. Semantic-guided Pixel Sampling for Cloth-Changing Person Re-identification

    Authors: Xiujun Shu, Ge Li, Xiao Wang, Weijian Ruan, Qi Tian

    Abstract: Cloth-changing person re-identification (re-ID) is a new rising research topic that aims at retrieving pedestrians whose clothes are changed. This task is quite challenging and has not been fully studied to date. Current works mainly focus on body shape or contour sketch, but they are not robust enough due to view and posture variations. The key to this task is to exploit cloth-irrelevant cues. Th… ▽ More

    Submitted 23 July, 2021; originally announced July 2021.

    Comments: This paper has been published on IEEE Signal Processing Letters

  44. arXiv:2107.10433  [pdf, other

    cs.CV cs.AI

    MFGNet: Dynamic Modality-Aware Filter Generation for RGB-T Tracking

    Authors: Xiao Wang, Xiujun Shu, Shiliang Zhang, Bo Jiang, Yaowei Wang, Yonghong Tian, Feng Wu

    Abstract: Many RGB-T trackers attempt to attain robust feature representation by utilizing an adaptive weighting scheme (or attention mechanism). Different from these works, we propose a new dynamic modality-aware filter generation module (named MFGNet) to boost the message communication between visible and thermal data by adaptively adjusting the convolutional kernels for various input images in practical… ▽ More

    Submitted 9 May, 2022; v1 submitted 21 July, 2021; originally announced July 2021.

    Comments: Accepted by IEEE TMM 2022

  45. arXiv:2107.09298  [pdf, other

    cs.SD eess.AS

    Joint Echo Cancellation and Noise Suppression based on Cascaded Magnitude and Complex Mask Estimation

    Authors: Xiaofeng Shu, Yehang Zhu, Yanjie Chen, Li Chen, Haohe Liu, Chuanzeng Huang, Yuxuan Wang

    Abstract: Acoustic echo and background noise can seriously degrade the intelligibility of speech. In practice, echo and noise suppression are usually treated as two separated tasks and can be removed with various digital signal processing (DSP) and deep learning techniques. In this paper, we propose a new cascaded model, magnitude and complex temporal convolutional neural network (MC-TCN), to jointly perfor… ▽ More

    Submitted 20 July, 2021; originally announced July 2021.

  46. arXiv:2106.10493  [pdf, other

    cs.CV

    CenterAtt: Fast 2-stage Center Attention Network

    Authors: Jianyun Xu, Xin Tang, Jian Dou, Xu Shu, Yushi Zhu

    Abstract: In this technical report, we introduce the methods of HIKVISION_LiDAR_Det in the challenge of waymo open dataset real-time 3D detection. Our solution for the competition are built upon Centerpoint 3D detection framework. Several variants of CenterPoint are explored, including center attention head and feature pyramid network neck. In order to achieve real time detection, methods like batchnorm mer… ▽ More

    Submitted 19 June, 2021; originally announced June 2021.

  47. arXiv:2105.15076  [pdf, other

    cs.CV

    Large-Scale Spatio-Temporal Person Re-identification: Algorithms and Benchmark

    Authors: Xiujun Shu, Xiao Wang, Xianghao Zang, Shiliang Zhang, Yuanqi Chen, Ge Li, Qi Tian

    Abstract: Person re-identification (re-ID) in the scenario with large spatial and temporal spans has not been fully explored. This is partially because that, existing benchmark datasets were mainly collected with limited spatial and temporal ranges, e.g., using videos recorded in a few days by cameras in a specific region of the campus. Such limited spatial and temporal ranges make it hard to simulate the d… ▽ More

    Submitted 27 November, 2021; v1 submitted 31 May, 2021; originally announced May 2021.

    Comments: Accepted by IEEE Transactions on Circuits and Systems for Video Technology (TCSVT)

  48. arXiv:2104.10319  [pdf, other

    cs.CR cs.AI

    Evidential Cyber Threat Hunting

    Authors: Frederico Araujo, Dhilung Kirat, Xiaokui Shu, Teryl Taylor, Jiyong Jang

    Abstract: A formal cyber reasoning framework for automating the threat hunting process is described. The new cyber reasoning methodology introduces an operational semantics that operates over three subspaces -- knowledge, hypothesis, and action -- to enable human-machine co-creation of threat hypotheses and protective recommendations. An implementation of this framework shows that the approach is practical… ▽ More

    Submitted 20 April, 2021; originally announced April 2021.

    Comments: 5 pages, SDM AI4CS 2021

    Journal ref: In Proceedings of the 2021 SIAM AI/ML for Cybersecurity Workshop (AI4CS)

  49. arXiv:2103.16989  [pdf, other

    cs.SI

    Bidirectional group random walk based network embedding for asymmetric proximity

    Authors: Jiawei Shen, Xincheng Shu, Hu Yang

    Abstract: Network embedding aims to represent a network into a low dimensional space where the network structural information and inherent properties are maximumly preserved. Random walk based network embedding methods such as DeepWalk and node2vec have shown outstanding performance in the aspect of preserving the network topological structure. However, these approaches either predict the distribution of a… ▽ More

    Submitted 31 March, 2021; originally announced March 2021.

    Comments: 18 pages, 7 figures

  50. arXiv:2103.16746  [pdf, other

    cs.CV cs.AI

    Towards More Flexible and Accurate Object Tracking with Natural Language: Algorithms and Benchmark

    Authors: Xiao Wang, Xiujun Shu, Zhipeng Zhang, Bo Jiang, Yaowei Wang, Yonghong Tian, Feng Wu

    Abstract: Tracking by natural language specification is a new rising research topic that aims at locating the target object in the video sequence based on its language description. Compared with traditional bounding box (BBox) based tracking, this setting guides object tracking with high-level semantic information, addresses the ambiguity of BBox, and links local and global search organically together. Thos… ▽ More

    Submitted 30 March, 2021; originally announced March 2021.

    Comments: Accepted by CVPR 2021