2024
- Robust Zero-Shot Crowd Counting and Localization with Adaptive Resolution SAM.
,
In: European Conference on Computer Vision (ECCV), Milano, Oct 2024. - A Secure Image Watermarking Framework with Statistical Guarantees via Adversarial Attacks on Secret Key Networks.
,
In: European Conference on Computer Vision (ECCV), Milano, Oct 2024. - Boosting 3D Single Object Tracking with 2D Matching Distillation and 3D Pre-training.
,
In: European Conference on Computer Vision (ECCV), Milano, Oct 2024. - Mahalanobis Distance-based Multi-view Optimal Transport for Multi-view Crowd Localization.
,
In: European Conference on Computer Vision (ECCV), Milano, Oct 2024. [Project&Code] - FreeDiff: Progressive Frequency Truncation for Image Editing with Diffusion Models.
,
In: European Conference on Computer Vision (ECCV), Milano, Oct 2024. - Human attention guided explainable artificial intelligence for computer vision models.
,
Neural Networks, 177:106392, Sep 2024. - Edit Temporal-Consistent Videos with Image Diffusion Model.
,
ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), to appear 2024. - Group-based Distinctive Image Captioning with Memory Difference Encoding and Attention.
,
International Journal of Computer Vision (IJCV), to appear 2024. - Gradient-based Visual Explanation for Transformer-based CLIP.
,
In: International Conference on Machine Learning (ICML), Vienna, Jul 2024. - The Pitfalls and Promise of Conformal Inference Under Adversarial Attacks.
,
In: International Conference on Machine Learning (ICML), Vienna, Jul 2024. - Is Holistic Processing Associated with Face Scanning Pattern and Performance in Face Recognition? Evidence from Deep Neural Network with Hidden Markov Modeling.
,
In: Annual Conference of the Cognitive Science Society (CogSci), Rotterdam, Jul 2024. - Eye Movement Behavior during Mind Wandering across Different Tasks in Interactive Online Learning.
,
In: Annual Conference of the Cognitive Science Society (CogSci), Rotterdam, Jul 2024. - Do large language models resolve semantic ambiguities in the same way as humans? The case of word segmentation in Chinese sentence reading.
,
In: Annual Conference of the Cognitive Science Society (CogSci), Rotterdam, Jul 2024. - Demystify Deep-learning AI for Object Detection using Human Attention Data.
,
In: Annual Conference of the Cognitive Science Society (CogSci), Rotterdam, Jul 2024. - Affecting Audience Valence and Arousal in 360 Immersive Environments: How Powerful Neural Style Transfer Is?
,
In: HCI International 2024 Conference (HCII2024) - Virtual, Augmented, and Mixed Reality, Washington DC, Jun 2024. - Understanding and Fighting Scams: Media, Language, Appeals and Effects.
,
In: HCI International 2024 Conference (HCII2024) - Late Breaking Papers, Washington DC, Jun 2024. - Learning Tracking Representations from Single Point Annotations.
,
In: CVPR Workshop on Learning With Limited Labelled Data for Image and Video Understanding (L3D-IVU), Jun 2024. - Another Perspective of Over-Smoothing: Alleviating Semantic Over-Smoothing in Deep GNNs.
,
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), to appear (online May 2024). - Generalized Characteristic Function Loss for Crowd Analysis in the Frequency Domain.
,
IEEE Trans. on Pattern Analysis and Machine Intelligence (TPAMI), 46(5):2882-2899, May 2024 (online Nov 2023). [supplemental] - Gradient-based Instance-Specific Visual Explanations for Object Specification and Object Discrimination.
,
IEEE Trans. on Pattern Analysis and Machine Intelligence (TPAMI), to appear 2024. - A Fixed-Point Approach to Unified Prompt-Based Counting.
,
In: AAAI Conference on Artificial Intelligence (AAAI), Vancouver, Feb 2024. [supplemental | code] - Multi-view People Detection in Large Scenes via Supervised View-wise Contribution Weighting.
,
In: AAAI Conference on Artificial Intelligence (AAAI), Feb 2024. [Project&Code]
2023
- Retrieval-Augmented Multiple Instance Learning.
,
In: Neural Information Processing Systems (NeurIPS), New Orleans, Dec 2023. - Single-Frame-Based Deep View Synchronization for Unsynchronized Multicamera Surveillance.
,
IEEE Trans. on Neural Networks and Learning Systems (TNNLS), 34(12):10653-10667, Dec 2023. [code&dataset] - Towards the next generation explainable AI that promotes AI-human mutual understanding.
,
In: NeurIPS workshop on XAI in Action: Past, Present, and Future Applications, New Orleans, Dec 2023. - Modeling Noisy Annotations for Point-Wise Supervision.
,
IEEE Trans. Pattern Analysis and Machine Intelligence (TPAMI), 45(12):15065-15080, Dec 2023 (online Jul 2023). - Scalable Video Object Segmentation with Simplified Framework.
,
In: International Conf. Computer Vision (ICCV), Paris, Oct 2023. [code] - Variational Nested Dropout.
,
IEEE Trans. on Pattern Analysis and Machine Intelligence (TPAMI), 45(8):10519-10534, Aug 2023 (online Feb 2023). - Human Attention-Guided Explainable AI for Object Detection.
,
In: Annual Conference of the Cognitive Science Society, July 2023. [arXiv] - Use of online therapy session data to develop behavioural markers for cognitive outcomes in non-pharmacological intervention.
,
In: Alzheimer's Association International Conference (AAIC), Amsterdam, July 2023. - TWINS: A Fine-Tuning Framework for Improved Transferability of Adversarial Robustness and Generalization.
,
In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Jun 2023. - Optimal Transport Minimization: Crowd Localization on Density Maps for Semi-Supervised Counting.
,
In: IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), Jun 2023 (highlight). [code] - DropMAE: Masked Autoencoders with Spatial-Attention Dropout for Tracking Tasks.
,
In: IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), Jun 2023. [code] - Pareto Optimization for Active Learning under Out-of-Distribution Data Scenarios.
,
Transactions on Machine Learning Research (TMLR), June 2023. - ODAM: Gradient-based Instance-Specific Visual Explanations for Object Detection.
,
In: Intl. Conf. on Learning Representations (ICLR), Rwanda, May 2023. [code] - Bayes-MIL: A New Probabilistic Perspective on Attention-based Multiple Instance Learning for Whole Slide Images.
,
In: Intl. Conf. on Learning Representations (ICLR), Rwanda, May 2023. - A Lightweight and Detector-Free 3D Single Object Tracker on Point Clouds.
,
IEEE Trans. on Intelligent Transportation Systems, 24(5):5543-5554, May 2023. - Visual attention to own- vs. other-race faces: Perspectives from learning mechanisms and task demands.
,
British Journal of Psychology, 114(S1):17-20, May 2023. - Clustering Hidden Markov Models With Variational Bayesian Hierarchical EM.
,
IEEE Trans. on Neural Networks and Learning Systems (TNNLS), 34(3):1537-1551, March 2023 (online 2021). - On Distinctive Image Captioning via Comparing and Reweighting.
,
IEEE Trans. on Pattern Analysis and Machine Intelligence (TPAMI), 45(2):2088-2103, Feb 2023 (online 2022). - On Becoming Socially Anxious: Toddlers’ Attention Bias to Fearful Faces.
,
Developmental Psychology, 59(2):353-363, Feb 2023. - Optimal planning of municipal-scale distributed rooftop photovoltaic systems with maximized solar energy generation under constraints in high-density cities.
,
Energy, 263(Part A):125686, Jan 2023.
2022
- 3D Crowd Counting via Geometric Attention-guided Multi-View Fusion.
,
International Journal of Computer Vision (IJCV), 130:3123-3139, Dec 2022. - RegGeoNet: Learning Regular Representations for Large-Scale 3D Point Clouds.
,
International Journal of Computer Vision (IJCV), 130:3100-3122, Dec 2022. - Improved Fine-Tuning by Better Leveraging Pre-Training Data.
,
In: Neural Information Processing Systems (NeurIPS), Nov 2022. - An Empirical Study on Distribution Shift Robustness From the Perspective of Pre-Training and Data Augmentation.
,
In: NeurIPS 2022 Workshop on Distribution Shifts: Connecting Methods and Applications (DistShift), Nov 2022. - Precise Augmentation and Counting of Helicobacter Pylori in Histology Image.
,
In: NeurIPS 2022 Workshop on Medical Imaging meets NeurIPS (MedNeurIPS), Nov 2022. - Boosting Adversarial Robustness From The Perspective of Effective Margin Regularization.
,
In: British Machine Vision Conference, Nov 2022. - Scale-Prior Deformable Convolution for Exemplar-Guided Class-Agnostic Counting.
,
In: British Machine Vision Conference, Nov 2022. - Bits-Ensemble: Towards Light-Weight Robust Deep Ensemble by Bits-Sharing.
,
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD), 41(11):4397-4408, Nov 2022 (CASES 2022). - Understanding the role of eye movement consistency in face recognition and autism through integrating deep neural networks and hidden Markov models.
,
npj Science of Learning, 7:28, Oct 2022. - Calibration-free Multi-view Crowd Counting.
,
In: European Conference on Computer Vision (ECCV), Tel Aviv, Oct 2022. [supplemental] - Understanding children's attention to traumatic dental injuries using eye-tracking.
,
Dental Traumatology, 38(5):410-416, Oct 2022. - Wide-Area Crowd Counting: Multi-View Fusion Networks for Counting in Large Scenes.
,
International Journal of Computer Vision (IJCV), 130(8):1938-1960, Aug 2022. - Asymptotic Optimality for Active Learning Processes.
,
In: Uncertainty in Artificial Intelligence (UAI), Aug 2022. [supplemental] - PRIMAL-GMM: PaRametrIc MAnifold Learning of Gaussian Mixture Models.
,
IEEE Trans. on Pattern Analysis and Machine Intelligence (TPAMI), 44(6):3197-3211, June 2022 (online 2021). [code] - Crowd Counting in the Frequency Domain.
,
In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), June 2022. - Understanding children’s attention to dental caries through eye-tracking.
,
Caries Research, 56(2):129-137, June 2022. - Eye movement analysis of children's attention for midline diastema.
,
Scientific Reports, 12:7462, May 2022. - Accelerating Monte Carlo Bayesian Prediction via Approximating Predictive Uncertainty over the Simplex.
,
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 33(4):1492-1506, Apr 2022 (online 2020). - System and Method for Optimizing A User Interface and A System and Method for Manipulating A User's Interaction with An Interface.
,
US Patent US11275596B2, March 2022. - Kernel-based Density Map Generation for Dense Object Counting.
,
IEEE Trans. Pattern Analysis and Machine Intelligence (TPAMI), 44(3):1357-1370, Mar 2022. - On Diversity in Image Captioning: Metrics and Methods.
,
IEEE Trans. on Pattern Analysis and Machine Intelligence (TPAMI), 44(2):1035-1049, Feb 2022. - The effects of attentional and interpretation biases on later pain outcomes among younger and older adults: A prospective study.
,
European Journal of Pain, 26(1):181-196, Jan 2022.
2021
- Chinese White Dolphin Detection in the Wild.
,
In: ACM Multimedia Asia (MMAsia), Gold Coast, Australia, Dec 2021. - Understanding the collinear masking effect in visual search through eye tracking.
,
Psychonomic Bulliten & Review, 28(6):1933-1943, Dec 2021. - Dynamic Momentum Adaptation for Zero-Shot Cross-Domain Crowd Counting.
,
In: ACM Multimedia (MM), Oct 2021. - Group-based Distinctive Image Captioning with Memory Attention.
,
In: ACM Multimedia (MM), Oct 2021 (oral). [supplemental] - BEV-Net: Assessing Social Distancing Compliance by Joint People Localization and Geometric Reasoning.
,
In: Intl. Conference on Computer Vision (ICCV), Oct 2021. - A Comparative Survey: Benchmarking for Pool-based Active Learning.
,
In: International Joint Conf. on Artificial Intelligence (IJCAI), Survey Track, Aug 2021. - Hierarchical Learning of Hidden Markov Models with Clustering Regularization.
,
In: 37th Conference on Uncertainty in Artificial Intelligence (UAI), Jul 2021. - Multiple-criteria Based Active Learning with Fixed-size Determinantal Point Processes.
,
In: Subset Selection in Machine Learning: From Theory to Applications, ICML Workshop, July 2021. - Improve Generalization and Robustness of Neural Networks via Weight Scale Shifting Invariant Regularizations.
,
In: ICML workshop on Adversarial Machine Learning, July 2021. - Meta-Graph Adaptation for Visual Object Tracking.
,
In: IEEE International Conference on Multimedia and Expo (ICME), Jul 2021 (oral). - A Generalized Loss Function for Crowd Counting and Localization.
,
In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Jun 2021. [supplemental] - Cross-View Cross-Scene Multi-View Crowd Counting.
,
In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR):557-567, Jun 2021. [supplemental | dataset] - Progressive Unsupervised Learning for Visual Object Tracking.
,
In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Jun 2021 (oral). [supplemental] - Bayesian Nested Neural Networks for Uncertainty Calibration and Adaptive Compression.
,
In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Jun 2021. - Do portrait artists have enhanced face processing abilities? Evidence from hidden Markov modeling of eye movements.
,
Cognition, 211(104616), June 2021. - Hand disambiguation using attention neural networks in the egocentric perspective.
,
In: International Conference on Digital Image Processing (ICDIP), May 2021. - Eye Movement analysis with Hidden Markov Models (EMHMM) with co-clustering.
,
Behavior Research Methods, 53:2473-2486, April 2021. - Angular-Driven Feedback Restoration Networks for Imperfect Sketch Recognition.
,
IEEE Trans. on Image Processing (TIP), 30:5085-5095, Apr 2021. - Applying the Hidden Markov Model to Analyze Urban Mobility Patterns: An Interdisciplinary Approach.
,
Chinese Geographical Science, 31(1):1-13, Feb 2021. - Fine-Grained Crowd Counting.
,
IEEE Trans. on Image Processing (TIP), 30:2114-2126, Jan 2021. [code | data] - Tracking-by-Counting: Using Network Flows on Crowd Density Maps for Tracking Multiple Targets.
,
IEEE Trans. on Image Processing (TIP), 30:1439-1452, 2021. - Visual Tracking via Dynamic Memory Networks.
,
IEEE Trans. on Pattern Analysis and Machine Intelligence (TPAMI), 43(1):360-374, Jan 2021. [code]
2020
- Modeling Noisy Annotations for Crowd Counting.
,
In: Neural Information Processing Systems (NeurIPS), Dec 2020. [supplemental] - Understanding visual attention to face emotions in social anxiety using hidden Markov models.
,
Cognition and Emotion, 34(8):1704-1710, Dec 2020. - The interrelation between interpretation biases, threat expectancies and pain-related attentional processing.
,
European Journal of Pain, 24(10):1956-1967, Nov 2020. - Neighbours Matter: Image Captioning with Similar Images.
,
In: British Machine Vision Conference (BMVC), Sep 2020. - Compare and Reweight: Distinctive Image Captioning Using Similar Images Sets.
,
In: European Conference on Computer Vision (ECCV), Aug 2020 (oral). - Interpretation biases and visual attention in the processing of ambiguous information in chronic pain.
,
European Journal of Pain, 24(7):1242-1256, Aug 2020. - Fully Nested Neural Network for Adaptive Compression and Quantization.
,
In: International Joint Conf. on Artificial Intelligence (IJCAI), Yokohama, July 2020. [supplemental] - Incorporating Side Information by Adaptive Convolution.
,
International Journal of Computer Vision (IJCV), 128:2897-2918, July 2020. - The role of eye movement consistency in learning to recognise faces: Computational and experimental examinations.
,
In: 42nd Annual Conference of the Cognitive Science Society (CogSci), Jul 2020. - ROAM: Recurrently Optimizing Tracking Model.
,
In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, Jun 2020. [code] - Eye movement analysis with switching hidden Markov models.
,
Behavior Research Methods, 52:1026-1043, June 2020. [appendix] - 3D Crowd Counting via Multi-View Fusion with 3D Gaussian Kernels.
,
In: AAAI Conference on Artificial Intelligence, AAAI, New York, Feb 2020. [supplemental]
2019
- Is that my hand? An egocentric dataset for hand disambiguation.
,
Image and Vision Computing, 89:131-143, Sept 2019. [dataset] - Adaptive Density Map Generation for Crowd Counting.
,
In: Intl. Conf. on Computer Vision (ICCV), Seoul, Oct 2019. - Hand Detection using Zoomed Neural Networks.
,
In: Intl. Conf. on Image Analysis and Processing (ICIAP), Trento, Sep 2019. - Parametric Manifold Learning of Gaussian Mixture Models.
,
In: International Joint Conference on Artificial Intelligence (IJCAI), Macau, Aug 2019. - Understanding Individual Differences in Eye Movement Pattern During Scene Perception through Co-Clustering of Hidden Markov Models.
,
In: The Annual Meeting of the Cognitive Science Society (CogSci), Montreal, Jul 2019. - ButtonTips: Designing Web Buttons with Suggestions.
,
In: IEEE International Conference on Multimedia and Expo (ICME), Shanghai, Jul 2019. - Describing like Humans: on Diversity in Image Captioning.
,
In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) , Long Beach, June 2019. [code] - Residual Regression with Semantic Prior for Crowd Counting.
,
In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) , Long Beach, June 2019. [code] - Wide-Area Crowd Counting via Ground-Plane Density Maps and Multi-View Fusion CNNs.
,
In: IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), Long Beach, June 2019. [dataset&code] - Density-Preserving Hierarchical EM Algorithm: Simplifying Gaussian Mixture Models for Approximate Inference.
,
IEEE Trans. on Pattern Analysis and Machine Intelligence (TPAMI), 41(6):1323-1337, June 2019. - Beyond Counting: Comparisons of Density Maps for Crowd Analysis Tasks - Counting, Detection, and Tracking.
,
IEEE Trans. on Circuits and Systems for Video Technology (TCSVT), 29(5):1408-1422, May 2019. - Hidden Markov modelling of eye movements in social anxiety: a data-driven machine-learning approach to eye-tracking research in psychopathology.
,
In: 2019 Anxiety & Depression Conference, Chicago, March 2019. - Individuals with Insomnia Misrecognize Angry Faces as Fearful Faces While Missing the Eyes: An Eye-Tracking Study.
,
SLEEP, 42(2), zsy220, Feb 2019. - A Fully Bayesian Infinite Generative Model for Dynamic Texture Segmentation.
,
arXiv:1901.03968, Jan 2019.
2018
- Eye Movement Patterns in Face Recognition are Associated with Cognitive Decline in Older Adults.
,
Psychonomic Bulletin & Review, 25(6):2200-2207, Dec 2018. - Hand detection using deformable part models on an egocentric perspective.
,
In: Digital Image Computing: Techniques and Applications (DICTA), Canberra, Dec 2018. - Gated Hierarchical Attention for Image Captioning.
,
In: Asian Conference on Computer Vision (ACCV), Perth, Dec 2018. [code] - Learning Dynamic Memory Networks for Object Tracking.
,
In: European Conference on Computer Vision (ECCV), Munich, Sept 2018. [code] - Crowd Counting by Adaptively Fusing Predictions from an Image Pyramid.
,
In: British Machine Vision Conference (BMVC), Newcastle, Sept 2018. [supplemental] - Music Reading Expertise Facilitates English but not Chinese sentence reading: Evidence from Eye Movement Behavior.
,
In: 15th International Conference on Music Perception and Cognition (ICMPC15), Sydney, July 2018. - Fusing Crowd Density Maps and Visual Object Trackers for People Tracking in Crowd Scenes.
,
In: IEEE Conf. Computer Vision and Pattern Recognition (CVPR), Salt Lake City, Jun 2018. - CNN+CNN: Convolutional Decoders for Image Captioning.
,
In: IEEE Computer Vision and Pattern Recognition: Language and Vision Workshop, Salt Lake City, Jun 2018. [code] - Color Orchestra: Ordering Color Palettes for Interpolation and Prediction.
,
IEEE Transactions on Visualization and Computer Graphics (TVCG), 24(6):1942-1955, June 2018. - Scanpath modeling and classification with Hidden Markov Models.
,
Behavior Research Methods, 50:362-379, Feb 2018.
2017
- Incorporating Side Information by Adaptive Convolution.
,
In: Neural Information Processing Systems, Long Beach, Dec 2017. [supplemental] - Is having similar eye movement patterns during face learning and recognition beneficial for recognition performance? Evidence from hidden Markov modeling.
,
Vision Research, 141:204-216, Dec 2017. - Hidden Markov model analysis reveals the advantage of analytic eye movement patterns in face recognition across cultures.
,
Cognition, 169:102-117, Dec 2017. - Mining Probabilistic Color Palettes for Summarizing Color Use in Artwork Collections.
,
In: Symposium on Visualization, SIGGRAPH Asia 2017, Bangkok, Nov 2017. [supplemental] - Recurrent filter learning for visual tracking.
,
In: ICCV 5th Visual Object Tracking Challenge Workshop VOT2017, Venice, Oct 2017. [video] - Insomniacs Misidentify Angry Faces as Fearful Faces Because of Missing the Eyes: an Eye-Tracking Study.
,
In: The Annual Meeting of the Cognitive Science Society (CogSci), London, July 2017. - Video Desnowing and Deraining Based on Matrix Decomposition.
,
In: IEEE Conf. Computer Vision and Pattern Recognition (CVPR), Honolulu, Jul 2017. - Martial Arts, Dancing and Sports Dataset: a Challenging Stereo and Multi-View Dataset for 3D Human Pose Estimation.
,
Image and Vision Computing, 61:22-39, May 2017. [supplemental] - Learning word embeddings via context grouping.
,
In: ACM Turing 50th Celebration Conference - China, Shanghai, May 2017. - Efficient tree-structured SfM by RANSAC generalized Procrustes analysis.
,
Computer Vision and Image Understanding (CVIU), 157:179-189, April 2017. - Maximum-Margin Structured Learning with Deep Networks for 3D Human Pose Estimation.
,
International Journal of Computer Vision (IJCV), 122(1):149-168, March 2017. - DynamicManga: Animating Still Manga via Camera Movement.
,
IEEE Trans. on Multimedia (TMM), 19(1):160-172, Jan 2017. [supplemental | video]
2016
- Approximate Inference for Generic Likelihoods via Density-Preserving GMM Simplification.
,
In: NIPS 2016 Workshop on Advances in Approximate Bayesian Inference, Barcelona, Dec 2016. - Directing User Attention via Visual Flow on Web Designs.
,
ACM Transactions on Graphics (Proc. SIGGRAPH Asia 2016), Dec 2016. [supplemental | video] - Information Distribution within Musical Segments.
,
Music Perception: An Interdisciplinary Journal, 34:218-242, Dec 2016. - Counting People Crossing a Line using Integer Programming and Local Features.
,
IEEE Trans. on Circuits and Systems for Video Technology (TCSVT), 26(10):1955-1969, Oct 2016. [appendix] - Mind reading: Discovering individual preferences from eye movements using switching hidden Markov models.
,
In: The Annual Meeting of the Cognitive Science Society (CogSci), Philadelphia, Aug 2016. - Analytic Eye Movement Patterns in Face Recognition are Associated with Better Performance and more Top-down Control of Visual Attention: an fMRI Study.
,
In: The Annual Meeting of the Cognitive Science Society (CogSci), Philadelphia, Aug 2016. - Hidden Markov Modeling of eye movements with image information lead to better discovery of regions of interest.
,
In: The Annual Meeting of the Cognitive Science Society (CogSci), Philadelphia, Aug 2016. - Patternista: Learning Element Style Compatibility and Spatial Composition for Ring-based Layout Decoration.
,
In: Non-Photorealistic Animation and Rendering (Expressive) 2016, Lisbon, May 2016.
2015
- Maximum-Margin Structured Learning with Deep Networks for 3D Human Pose Estimation.
,
In: Intl. Conf. on Computer Vision (ICCV):2848-2856, Santiago, Dec 2015. [spotlight video] - FlexyFont: Learning Transferring Rules for Flexible Typeface Synthesis.
,
Computer Graphics Forum (Proc. Pacific Graphics 2015), 34(7), Oct 2015. [video] - Hidden Markov model analysis reveals better eye movement strategies in face recognition.
,
In: The Annual Meeting of the Cognitive Science Society (CogSci):393-398, Pasadena, Jul 2015. - Eye Movement Pattern in Face Recognition is Associated with Cognitive Decline in the Elderly.
,
In: The Annual Meeting of the Cognitive Science Society (CogSci):321-326, Pasadena, Jul 2015. - Small Instance Detection by Integer Programming on Object Density Maps.
,
In: IEEE Conf. Computer Vision and Pattern Recognition (CVPR), Boston, Jun 2015. [extended abstract] - Global and Local Priming Evoke Different Face Processing Strategies: Evidence From An Eye Movement Study.
,
In: Annual meeting of the Vision Sciences Society (VSS), St. Pete Beach, Florida, May 2015. - Heterogeneous Multi-task Learning for Human Pose Estimation with Deep Convolutional Neural Network.
,
International Journal of Computer Vision (IJCV), 113(1):19-36, May 2015. - An SVD-based Multimodal Clustering Method for Social Event Detection.
,
In: ICDE 2015 Workshop on Scalable Social Event Processing and Management (SSEPM), Seoul, Korea, Apr 2015. - A Scalable and Accurate Descriptor for Dynamic Textures using Bag of System Trees.
,
IEEE Trans. on Pattern Analysis and Machine Intelligence (TPAMI), 37(4):697-712, Apr 2015. [appendix] - Leveraging Long-Term Predictions and Online Learning in Agent-Based Multiple Person Tracking.
,
IEEE Trans. on Circuits and Systems for Video Technology (TCSVT), 25(3):399-410, Mar 2015. - Enhanced Figure-Ground Classification with Background Prior Propagation.
,
IEEE Trans. on Image Processing (TIP), 24(3):873-885, Mar 2015.
2014
- A Robust Likelihood Function for 3D Human Pose Tracking.
,
IEEE Trans. on Image Processing (TIP), 23(12):5374-5389, Dec 2014. - 3D Human Pose Estimation from Monocular Images with Deep Convolutional Neural Network.
,
In: Asian Conference on Computer Vision (ACCV), Singapore, Nov 2014. - Look Closely: Learning Exemplar Patches for Recognizing Textiles from Product Images.
,
In: Asian Conference on Computer Vision (ACCV), Singapore, Nov 2014. - Understanding eye movements in face recognition using hidden Markov models.
,
Journal of Vision, 14(11):8, Sep 2014. - Look Over Here: Attention-Directing Composition of Manga Elements.
,
ACM Transactions on Graphics (Proc. SIGGRAPH 2014), Aug 2014. [supplemental | video | slides] - Heterogeneous Multi-task Learning for Human Pose Estimation with Deep Convolutional Neural Network.
,
In: IEEE Conf. Computer Vision and Pattern Recognition: DeepVision Workshop, Columbus, Jun 2014. - Joint Motion Segmentation and Background Estimation in Dynamic Scenes.
,
In: IEEE Conf. Computer Vision and Pattern Recognition (CVPR), Columbus, Jun 2014. [spotlight video] - Caucasian and Asian eye movement patterns in face recognition: A computational exploration using hidden Markov models.
,
In: Annual meeting of the Vision Sciences Society (VSS), May 2014. - Understanding eye movements in face recognition with hidden Markov model.
,
In: 6th Chinese International Conference on Eye Movements (CICEM), Beijing, May 2014 (oral). - Clustering hidden Markov models with variational HEM.
,
Journal of Machine Learning Research (JMLR), 15(2):697-747, Feb 2014. [code]
2013
- Surveillance of Crowded Environments: Modeling the Crowd by its Global Properties.
,
In Modeling, Simulation and Visual Analysis of Crowds: A Multidisciplinary Perspective, Springer, New York, Dec 2013. - Objective Measures of IS Usage Behavior Under Conditions of Experience and Pressure Using Eye Fixation Data.
,
In: International Conference on Information Systems (ICIS), Milan, Dec 2013. - A Bag of Systems Representation for Music Auto-tagging.
,
IEEE Trans. on Audio, Speech and Language Processing (TASLP), 21(12):2554-2569, Dec 2013. - Understanding eye movements in face recognition with hidden Markov model.
,
In: The Annual Meeting of the Cognitive Science Society (CogSci), Berlin, Aug 2013 (oral, Student travel award winner). - Clustering Dynamic Textures with the Hierarchical EM Algorithm for Modeling Video.
,
IEEE Trans. on Pattern Analysis and Machine Intelligence (TPAMI), 35(7):1606-1621, Jul 2013. [appendix] - That was fast! Speeding up NN search of high dimensional distributions.
,
In: International Conference on Machine Learning (ICML), Atlanta, Jun 2013. [supplemental | code] - Crossing the Line: Crowd Counting by Integer Programming with Local Features.
,
In: IEEE Conf. Computer Vision and Pattern Recognition (CVPR), Portland, Jun 2013.
2012
- The variational hierarchical EM algorithm for clustering hidden Markov models.
,
In: Neural Information Processing Systems (NIPS), Lake Tahoe, Dec 2012. - Automatic Stylistic Manga Layout.
,
ACM Transactions on Graphics (Proc. SIGGRAPH Asia 2012), Singapore, Nov 2012. [supplemental | video | slides] - Multivariate autoregressive mixture models for music auto-tagging.
,
In: International Society for Music Information Retrieval Conference (ISMIR), Porto, Oct 2012. - Growing a Bag of Systems Tree for Fast and Accurate Classification.
,
In: IEEE Conf. Computer Vision and Pattern Recognition (CVPR), Providence, Jun 2012. - Adaptive Figure-Ground Classification.
,
In: IEEE Conf. Computer Vision and Pattern Recognition (CVPR), Providence, Jun 2012. - Counting People with Low-Level Features and Bayesian Regression.
,
IEEE Trans. on Image Processing (TIP), 21(4):2170-2177, May 2012.
2011
- Generalized Stauffer-Grimson background subtraction for dynamic scenes.
,
Machine Vision and Applications, 22(5):751-766, Sep 2011. - Time Series Models for Semantic Music Annotation.
,
IEEE Trans. on Audio, Speech and Language Processing (TASLP), 19(5):1343-1359, Jul 2011. - Generalized Gaussian Process Models.
,
In: IEEE Conf. Computer Vision and Pattern Recognition (CVPR), Colorado Springs, Jun 2011. [supplemental] - Genre Classification and the Invariance of MFCC Features to Key and Tempo.
,
In: Intl. Conference on MultiMedia Modeling (MMM), Taipei, Jan 2011.
2010
- Automatic music tagging with time series models.
,
In: International Society for Music Information Retrieval Conference (ISMIR), Utrecht, Aug 2010. - Clustering Dynamic Textures with the Hierarchical EM Algorithm.
,
In: IEEE Conf. Computer Vision and Pattern Recognition (CVPR), San Francisco, Jun 2010. [supplemental] - Modeling music as a dynamic texture.
,
IEEE Trans. on Audio, Speech and Language Processing (TASLP), 18(3):602-612, Mar 2010. - Automatic Musical Pattern Feature Extraction Using Convolutional Neural Network.
,
In: Intl. MultiConference of Engineers and Computer Scientists (IMECS), Hong Kong, Mar 2010.
2009
- Layered dynamic textures.
,
IEEE Trans. on Pattern Analysis and Machine Intelligence (TPAMI), 31(10):1862-1879, Oct 2009. - Bayesian Poisson Regression for Crowd Counting.
,
In: IEEE Intl Conf. on Computer Vision (ICCV), Kyoto, Sep 2009. - Analysis of Crowded Scenes using Holistic Properties.
,
In: 11th IEEE Intl. Workshop on Performance Evaluation of Tracking and Surveillance (PETS 2009), Miami, Jun 2009. - Variational Layered Dynamic Textures.
,
In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Miami, Jun 2009. - Derivations for the Layered Dynamic Texture and Temporally-Switching Layered Dynamic Texture.
,
Technical Report SVCL-TR-2009-01, Jun 2009. - Dynamic Texture Models of Music.
,
In: Intl. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), Taipei, Apr 2009.
2008
- Beyond Dynamic Textures: a Family of Stochastic Dynamical Models for Video with Applications to Computer Vision.
,
Ph.D. Thesis, University of California San Diego, Dec 2008. [supplemental] - Privacy Preserving Crowd Monitoring: Counting People without People Models or Tracking.
,
In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Anchorage, Jun 2008. - Modeling, clustering, and segmenting video with mixtures of dynamic textures.
,
IEEE Trans. on Pattern Analysis and Machine Intelligence (TPAMI), 30(5):909-926, May 2008.
2007
- Direct Convex Relaxations of Sparse SVM.
,
In: International Conference on Machine Learning (ICML), Corvallis, Jun 2007. [old version using partial wine dataset] - Classifying Video with Kernel Dynamic Textures.
,
In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Minneapolis, Jun 2007. - Supplemental for "Classifying Video with Kernel Dynamic Textures".
,
Technical Report SVCL-TR-2007-03, Apr 2007. [video] - Audio Information Retrieval using Semantic Similarity.
,
In: International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Honolulu, Apr 2007. - Supervised learning of semantic classes for image annotation and retrieval.
,
IEEE Trans. on Pattern Analysis and Machine Intelligence (TPAMI), 29(3):394-410, Mar 2007.
2006
- Using Statistics to Search and Annotate Pictures: an Evaluation of Semantic Image Annotation and Retrieval on Large Databases.
,
In: Proceedings of the American Statistical Association, Seattle, Aug 2006. - On Measuring the Change in Size of Pulmonary Nodules.
,
IEEE Transactions on Medical Imaging (TMI), 25(4):435-450, Apr 2006.
2005
- Layered Dynamic Textures.
,
In: Neural Information Processing Systems 18 (NIPS), Vancouver, Dec 2005. - Mixtures of Dynamic Textures.
,
In: IEEE International Conference on Computer Vision (ICCV), Beijing, Oct 2005. - Classification and Retrieval of Traffic Video using Auto-regressive Stochastic Processes.
,
In: 2005 IEEE Intelligent Vehicles Symposium (IEEEIV), Las Vegas, Jun 2005. - Probabilistic Kernels for the Classification of Auto-regressive Visual Processes.
,
In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), San Diego, Jun 2005. [8-page version] - The EM Algorithm for Mixtures of Dynamic Textures.
,
Technical Report SVCL-TR-2005-02, Jan 2005. [see 2008 TPAMI paper]
2004
- Efficient Computation of the KL Divergence between Dynamic Textures.
,
Technical Report SVCL-TR-2004-02, Nov 2004. [a more efficient algorithm is discussed in my thesis] - A Family of Probabilistic Kernels Based on Information Divergence.
,
Technical Report SVCL-TR-2004-01, Jun 2004.
2003
- System, method and apparatus for small pulmonary nodule computer aided diagnosis from computed tomography scans.
,
US Patents 7,499,578 B2 (2009) and 7,751,607 B2 (2010).
External links: Google Scholar | Microsoft Academic | orcid.org/0000-0002-2886-2513 | Scopus ID: 14015159100
IEEE Copyright Notice
©IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.
©IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.
UC Press copying and permissions notice: ©2017 by the Regents of the University of California. Authorization to copy this content beyond fair use (as specified in Sections 107 and 108 of the U. S. Copyright Law) for internal or personal use, or the internal or personal use of specific clients, is granted by [the Regents of the University of California/on behalf of the Sponsoring Society] for libraries and other users, provided that they are registered with and pay the specified fee via Rightslink® or directly with the Copyright Clearance Center.