Welcome to the Video, Image, and Sound Analysis Lab (VISAL) at the City University of Hong Kong! The lab is directed by Prof. Antoni Chan in the Department of Computer Science.

Our main research activities include:

  • Computer Vision, Surveillance
  • Machine Learning, Pattern Recognition
  • Computer Audition, Music Information Retrieval
  • Eye Gaze Analysis

For more information about our current research, please visit the projects and publication pages.

Opportunities for graduate students and research assistants – if you are interested in joining the lab, please check this information.

Latest News [more]

  • [Aug 16, 2021]

    Congratulations Qingzhong for defending his thesis!

  • [Aug 12, 2021]

    Congratulations to Jia for defending his thesis!

  • [Jul 1, 2021]

    Dr. Chan was promoted to Professor!

  • [Apr 29, 2021]

    Congratulations to Yufei for defending his thesis!

Recent Publications [more]

  • Dynamic Momentum Adaptation for Zero-Shot Cross-Domain Crowd Counting.
    Qiangqiang Wu, Jia Wan, and Antoni B. Chan,
    In: ACM Multimedia (MM), to appear Oct 2021.
  • Group-based Distinctive Image Captioning with Memory Attention.
    Jiuniu Wang, Wenjia Xu, Qingzhong Wang, and Antoni B. Chan,
    In: ACM Multimedia (MM), to appear Oct 2021 (oral).
  • A Comparative Survey: Benchmarking for Pool-based Active Learning.
    Xueying Zhan, Huan Liu, Qing Li, and Antoni B. Chan,
    In: International Joint Conf. on Artificial Intelligence (IJCAI), Survey Track, to appear Aug 2021.
  • Clustering Hidden Markov Models With Variational Bayesian Hierarchical EM.
    Hui Lan, Ziquan Liu, Janet H. Hsiao, Dan Yu, and Antoni B. Chan,
    IEEE Trans. on Neural Networks and Learning Systems (TNNLS), To appear 2021.
  • The effects of attentional and interpretation biases on later pain outcomes among younger and older adults: A prospective study.
    Frederick H.F. Chan, Hin Suen, Antoni B. Chan, Janet H. Hsiao, and Tom J. Barry,
    European Journal of Pain, [online] Aug 2021.
  • Hierarchical Learning of Hidden Markov Models with Clustering Regularization.
    Hui Lan and Antoni B. Chan,
    In: 37th Conference on Uncertainty in Artificial Intelligence (UAI), Jul 2021.
  • Multiple-criteria Based Active Learning with Fixed-size Determinantal Point Processes.
    Xueying Zhan, Qing Li, and Antoni B. Chan,
    In: Subset Selection in Machine Learning: From Theory to Applications, ICML Workshop, July 2021.
  • Improve Generalization and Robustness of Neural Networks via Weight Scale Shifting Invariant Regularizations.
    Ziquan Liu, Yufei Cui, and Antoni B. Chan,
    In: ICML workshop on Adversarial Machine Learning, July 2021.
  • Meta-Graph Adaptation for Visual Object Tracking.
    Qiangqiang Wu and Antoni B. Chan,
    In: IEEE International Conference on Multimedia and Expo (ICME), Jul 2021 (oral).
  • A Generalized Loss Function for Crowd Counting and Localization.
    Jia Wan, Ziquan Liu, and Antoni B. Chan,
    In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Jun 2021. [supplemental]

Recent Project Pages [more]

Eye Movement analysis with Hidden Markov Models (EMHMM) with co-clustering

We analyze eye movement data on stimuli with different feature layouts. Through co-clustering HMMs, we discover common strategies on each stimuli and cluster subjects with similar strategies.

Meta-Graph Adaptation for Visual Object Tracking

In this paper, we propose a novel meta-graph adaptation network (MGA-Net) to effectively adapt backbone feature extractors in existing deep trackers to a specific online tracking task.

Progressive Unsupervised Learning for Visual Object Tracking

In this paper, we propose a progressive unsupervised learning (PUL) framework, which entirely removes the need for annotated training videos in visual tracking.

Fully Nested Neural Network for Adaptive Compression and Quantization

We propose a fully nested neural network (FN3) that runs only once to build a nested set of compressed/quantized models, which is optimal for different resource constraints. We then propose a Bayesian version that estimates the ordered dropout hyperparameter and has well calibrated uncertainty.

A Generalized Loss Function for Crowd Counting and Localization

We propose a generalized loss function for density map regression based on unbalanced optimal transport. We prove that pixel-wise L2 loss and Bayesian loss are special cases and sub-optimal solutions to our proposed loss. Since the predicted density will be pushed toward annotation positions, the density map prediction will be sparse and can naturally be used for localization.

Recent Datasets and Code [more]

CVCS: Cross-View Cross-Scene Multi-View Crowd Counting Dataset

Synthetic dataset for cross-view cross-scene multi-view counting. The dataset contains 31 scenes, each with about ~100 camera views. For each scene, we capture 100 multi-view images of crowds.

Fine-Grained Crowd Counting Dataset

Dataset for fine-grained crowd counting, which differentiates a crowd into categories based on the low-level behavior attributes of the individuals (e.g. standing/sitting or violent behavior) and then counts the number of people in each category.

Parametric Manifold Learning of Gaussian Mixture Models (PRIMAL-GMM) Toolbox

This is a python toolbox learning parametric manifolds of Gaussian mixture models (GMMs).

Eye Movement analysis with Switching HMMs (EMSHMM) Toolbox

This is a MATLAB toolbox for analyzing eye movement data using switching hidden Markov models (SHMMs), for analyzing eye movement data in cognitive tasks involving cognitive state changes. It includes code for learning SHMMs for individuals, as well as analyzing the results.

EgoDaily – Egocentric dataset for Hand Disambiguation

Egocentric hand detection dataset with variability on people, activities and places, to simulate daily life situations.