Skip to main content

Showing 1–50 of 74 results for author: Ho, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.00307  [pdf, other

    cs.LG cs.AI cs.CL

    ABC Align: Large Language Model Alignment for Safety & Accuracy

    Authors: Gareth Seneque, Lap-Hang Ho, Ariel Kuperman, Nafise Erfanian Saeedi, Jeffrey Molendijk

    Abstract: Alignment of Large Language Models (LLMs) remains an unsolved problem. Human preferences are highly distributed and can be captured at multiple levels of abstraction, from the individual to diverse populations. Organisational preferences, represented by standards and principles, are defined to mitigate reputational risk or meet legislative obligations. In this paper, we present ABC Align, a novel… ▽ More

    Submitted 1 August, 2024; originally announced August 2024.

    Comments: 23 pages, 4 figures

    MSC Class: 68T50 ACM Class: I.2.7

  2. arXiv:2407.00805  [pdf, other

    cs.AI

    Towards shutdownable agents via stochastic choice

    Authors: Elliott Thornley, Alexander Roman, Christos Ziakas, Leyton Ho, Louis Thomson

    Abstract: Some worry that advanced artificial agents may resist being shut down. The Incomplete Preferences Proposal (IPP) is an idea for ensuring that doesn't happen. A key part of the IPP is using a novel 'Discounted REward for Same-Length Trajectories (DREST)' reward function to train agents to (1) pursue goals effectively conditional on each trajectory-length (be 'USEFUL'), and (2) choose stochastically… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  3. arXiv:2406.18691  [pdf, other

    cs.CV

    Geometric Features Enhanced Human-Object Interaction Detection

    Authors: Manli Zhu, Edmond S. L. Ho, Shuang Chen, Longzhi Yang, Hubert P. H. Shum

    Abstract: Cameras are essential vision instruments to capture images for pattern detection and measurement. Human-object interaction (HOI) detection is one of the most popular pattern detection approaches for captured human-centric visual scenes. Recently, Transformer-based models have become the dominant approach for HOI detection due to their advanced network architectures and thus promising results. Howe… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: Accepted to IEEE TIM

  4. arXiv:2406.14988  [pdf

    eess.IV cs.AI

    Introducing the Biomechanics-Function Relationship in Glaucoma: Improved Visual Field Loss Predictions from intraocular pressure-induced Neural Tissue Strains

    Authors: Thanadet Chuangsuwanich, Monisha E. Nongpiur, Fabian A. Braeu, Tin A. Tun, Alexandre Thiery, Shamira Perera, Ching Lin Ho, Martin Buist, George Barbastathis, Tin Aung, Michaƫl J. A. Girard

    Abstract: Objective. (1) To assess whether neural tissue structure and biomechanics could predict functional loss in glaucoma; (2) To evaluate the importance of biomechanics in making such predictions. Design, Setting and Participants. We recruited 238 glaucoma subjects. For one eye of each subject, we imaged the optic nerve head (ONH) using spectral-domain OCT under the following conditions: (1) primary ga… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: 19 pages, 2 figures

  5. arXiv:2405.16204  [pdf, other

    cs.CV cs.AI cs.GR

    VOODOO XP: Expressive One-Shot Head Reenactment for VR Telepresence

    Authors: Phong Tran, Egor Zakharov, Long-Nhat Ho, Liwen Hu, Adilbek Karmanov, Aviral Agarwal, McLean Goldwhite, Ariana Bermudez Venegas, Anh Tuan Tran, Hao Li

    Abstract: We introduce VOODOO XP: a 3D-aware one-shot head reenactment method that can generate highly expressive facial expressions from any input driver video and a single 2D portrait. Our solution is real-time, view-consistent, and can be instantly used without calibration or fine-tuning. We demonstrate our solution on a monocular video setting and an end-to-end VR telepresence system for two-way communi… ▽ More

    Submitted 28 May, 2024; v1 submitted 25 May, 2024; originally announced May 2024.

  6. arXiv:2405.11690  [pdf, other

    cs.CV

    InterAct: Capture and Modelling of Realistic, Expressive and Interactive Activities between Two Persons in Daily Scenarios

    Authors: Yinghao Huang, Leo Ho, Dafei Qin, Mingyi Shi, Taku Komura

    Abstract: We address the problem of accurate capture and expressive modelling of interactive behaviors happening between two persons in daily scenarios. Different from previous works which either only consider one person or focus on conversational gestures, we propose to simultaneously model the activities of two persons, and target objective-driven, dynamic, and coherent interactions which often span long… ▽ More

    Submitted 27 May, 2024; v1 submitted 19 May, 2024; originally announced May 2024.

    Comments: The first two authors contributed equally to this work

  7. arXiv:2404.14068  [pdf, other

    cs.AI cs.LG

    Holistic Safety and Responsibility Evaluations of Advanced AI Models

    Authors: Laura Weidinger, Joslyn Barnhart, Jenny Brennan, Christina Butterfield, Susie Young, Will Hawkins, Lisa Anne Hendricks, Ramona Comanescu, Oscar Chang, Mikel Rodriguez, Jennifer Beroshi, Dawn Bloxwich, Lev Proleev, Jilin Chen, Sebastian Farquhar, Lewis Ho, Iason Gabriel, Allan Dafoe, William Isaac

    Abstract: Safety and responsibility evaluations of advanced AI models are a critical but developing field of research and practice. In the development of Google DeepMind's advanced AI models, we innovated on and applied a broad set of approaches to safety evaluation. In this report, we summarise and share elements of our evolving approach as well as lessons learned for a broad audience. Key lessons learned… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: 10 pages excluding bibliography

  8. arXiv:2404.05490  [pdf, other

    cs.CV

    Two-Person Interaction Augmentation with Skeleton Priors

    Authors: Baiyi Li, Edmond S. L. Ho, Hubert P. H. Shum, He Wang

    Abstract: Close and continuous interaction with rich contacts is a crucial aspect of human activities (e.g. hugging, dancing) and of interest in many domains like activity recognition, motion prediction, character animation, etc. However, acquiring such skeletal motion is challenging. While direct motion capture is expensive and slow, motion editing/generation is also non-trivial, as complex contact pattern… ▽ More

    Submitted 9 April, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

  9. arXiv:2403.15605  [pdf, other

    cs.CV cs.LG

    Efficiently Assemble Normalization Layers and Regularization for Federated Domain Generalization

    Authors: Khiem Le, Long Ho, Cuong Do, Danh Le-Phuoc, Kok-Seng Wong

    Abstract: Domain shift is a formidable issue in Machine Learning that causes a model to suffer from performance degradation when tested on unseen domains. Federated Domain Generalization (FedDG) attempts to train a global model using collaborative clients in a privacy-preserving manner that can generalize well to unseen clients possibly with domain shift. However, most existing FedDG methods either cause ad… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

  10. arXiv:2403.13793  [pdf, other

    cs.LG

    Evaluating Frontier Models for Dangerous Capabilities

    Authors: Mary Phuong, Matthew Aitchison, Elliot Catt, Sarah Cogan, Alexandre Kaskasoli, Victoria Krakovna, David Lindner, Matthew Rahtz, Yannis Assael, Sarah Hodkinson, Heidi Howard, Tom Lieberum, Ramana Kumar, Maria Abi Raad, Albert Webson, Lewis Ho, Sharon Lin, Sebastian Farquhar, Marcus Hutter, Gregoire Deletang, Anian Ruoss, Seliem El-Sayed, Sasha Brown, Anca Dragan, Rohin Shah , et al. (2 additional authors not shown)

    Abstract: To understand the risks posed by a new AI system, we must understand what it can and cannot do. Building on prior work, we introduce a programme of new "dangerous capability" evaluations and pilot them on Gemini 1.0 models. Our evaluations cover four areas: (1) persuasion and deception; (2) cyber-security; (3) self-proliferation; and (4) self-reasoning. We do not find evidence of strong dangerous… ▽ More

    Submitted 5 April, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

  11. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1110 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 8 August, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  12. arXiv:2312.13776  [pdf, other

    cs.CV

    Pose-based Tremor Type and Level Analysis for Parkinson's Disease from Video

    Authors: Haozheng Zhang, Edmond S. L. Ho, Xiatian Zhang, Silvia Del Din, Hubert P. H. Shum

    Abstract: Purpose:Current methods for diagnosis of PD rely on clinical examination. The accuracy of diagnosis ranges between 73% and 84%, and is influenced by the experience of the clinical assessor. Hence, an automatic, effective and interpretable supporting system for PD symptom identification would support clinicians in making more robust PD diagnostic decisions. Methods: We propose to analyze Parkinson'… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

  13. arXiv:2312.04651  [pdf, other

    cs.CV

    VOODOO 3D: Volumetric Portrait Disentanglement for One-Shot 3D Head Reenactment

    Authors: Phong Tran, Egor Zakharov, Long-Nhat Ho, Anh Tuan Tran, Liwen Hu, Hao Li

    Abstract: We present a 3D-aware one-shot head reenactment method based on a fully volumetric neural disentanglement framework for source appearance and driver expressions. Our method is real-time and produces high-fidelity and view-consistent output, suitable for 3D teleconferencing systems based on holographic displays. Existing cutting-edge 3D-aware reenactment methods often use neural radiance fields or… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

  14. arXiv:2312.00656  [pdf, other

    cs.LG cs.AI stat.ML

    Simple Transferability Estimation for Regression Tasks

    Authors: Cuong N. Nguyen, Phong Tran, Lam Si Tung Ho, Vu Dinh, Anh T. Tran, Tal Hassner, Cuong V. Nguyen

    Abstract: We consider transferability estimation, the problem of estimating how well deep learning models transfer from a source to a target task. We focus on regression tasks, which received little previous attention, and propose two simple and computationally efficient approaches that estimate transferability based on the negative regularized mean squared error of a linear regression model. We prove novel… ▽ More

    Submitted 3 December, 2023; v1 submitted 1 December, 2023; originally announced December 2023.

    Comments: Paper published at The 39th Conference on Uncertainty in Artificial Intelligence (UAI) 2023

  15. arXiv:2311.12355  [pdf, other

    cs.IR cs.CL cs.LG

    Utilizing Language Models for Tour Itinerary Recommendation

    Authors: Ngai Lam Ho, Kwan Hui Lim

    Abstract: Tour itinerary recommendation involves planning a sequence of relevant Point-of-Interest (POIs), which combines challenges from the fields of both Operations Research (OR) and Recommendation Systems (RS). As an OR problem, there is the need to maximize a certain utility (e.g., popularity of POIs in the tour) while adhering to some constraints (e.g., maximum time for the tour). As a RS problem, it… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

    Comments: PMAI23 @IJCAI 2023 2nd International Workshop on Process Management in the AI era

  16. arXiv:2311.11071  [pdf, other

    cs.IR cs.AI cs.LG cs.SI

    SBTRec- A Transformer Framework for Personalized Tour Recommendation Problem with Sentiment Analysis

    Authors: Ngai Lam Ho, Roy Ka-Wei Lee, Kwan Hui Lim

    Abstract: When traveling to an unfamiliar city for holidays, tourists often rely on guidebooks, travel websites, or recommendation systems to plan their daily itineraries and explore popular points of interest (POIs). However, these approaches may lack optimization in terms of time feasibility, localities, and user preferences. In this paper, we propose the SBTRec algorithm: a BERT-based Trajectory Recommen… ▽ More

    Submitted 18 November, 2023; originally announced November 2023.

    Report number: 01

  17. arXiv:2310.19886  [pdf

    cs.LG cs.IR cs.SI

    BTRec: BERT-Based Trajectory Recommendation for Personalized Tours

    Authors: Ngai Lam Ho, Roy Ka-Wei Lee, Kwan Hui Lim

    Abstract: An essential task for tourists having a pleasant holiday is to have a well-planned itinerary with relevant recommendations, especially when visiting unfamiliar cities. Many tour recommendation tools only take into account a limited number of factors, such as popular Points of Interest (POIs) and routing constraints. Consequently, the solutions they provide may not always align with the individual… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: RecSys 2023, Workshop on Recommenders in Tourism

  18. arXiv:2310.18891  [pdf, other

    cs.HC cs.CY cs.RO eess.SY

    Social Interaction-Aware Dynamical Models and Decision Making for Autonomous Vehicles

    Authors: Luca Crosato, Kai Tian, Hubert P. H Shum, Edmond S. L. Ho, Yafei Wang, Chongfeng Wei

    Abstract: Interaction-aware Autonomous Driving (IAAD) is a rapidly growing field of research that focuses on the development of autonomous vehicles (AVs) that are capable of interacting safely and efficiently with human road users. This is a challenging task, as it requires the autonomous vehicle to be able to understand and predict the behaviour of human road users. In this literature review, the current s… ▽ More

    Submitted 30 October, 2023; v1 submitted 28 October, 2023; originally announced October 2023.

  19. arXiv:2310.05892  [pdf, ps, other

    stat.ML cs.LG

    A Generalization Bound of Deep Neural Networks for Dependent Data

    Authors: Quan Huu Do, Binh T. Nguyen, Lam Si Tung Ho

    Abstract: Existing generalization bounds for deep neural networks require data to be independent and identically distributed (iid). This assumption may not hold in real-life applications such as evolutionary biology, infectious disease epidemiology, and stock price prediction. This work establishes a generalization bound of feed-forward neural networks for non-stationary $Ļ†$-mixing data.

    Submitted 9 October, 2023; originally announced October 2023.

  20. arXiv:2309.02235  [pdf, other

    cs.IT

    Experimental Evaluation of Air-to-Ground VHF Band Communication for UAV Relays

    Authors: Boris Galkin, Lester Ho, Ken Lyons, Gokhan Celik, Holger Claussen

    Abstract: Unmanned Aerial Vehicles (UAVs) are a disruptive technology that is transforming a range of industries. Because they operate in the sky, UAVs are able to take advantage of strong Line-of-Sight (LoS) channels for radio propagation, allowing them to communicate over much larger distances than equivalent hardware located at ground level. This has attracted the attention of organisations such as the I… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

    Comments: Pre-print of paper presented at the Workshop on Integrating UAVs into 5G and Beyond at IEEE International Conference on Communications 2023

  21. arXiv:2308.15514  [pdf, other

    cs.AI

    International Governance of Civilian AI: A Jurisdictional Certification Approach

    Authors: Robert Trager, Ben Harack, Anka Reuel, Allison Carnegie, Lennart Heim, Lewis Ho, Sarah Kreps, Ranjit Lall, Owen Larter, SeĆ”n Ɠ hƉigeartaigh, Simon Staffell, JosĆ© Jaime Villalobos

    Abstract: This report describes trade-offs in the design of international governance arrangements for civilian artificial intelligence (AI) and presents one approach in detail. This approach represents the extension of a standards, licensing, and liability regime to the global level. We propose that states establish an International AI Organization (IAIO) to certify state jurisdictions (not firms or AI proj… ▽ More

    Submitted 11 September, 2023; v1 submitted 29 August, 2023; originally announced August 2023.

  22. arXiv:2307.04699  [pdf, other

    cs.CY

    International Institutions for Advanced AI

    Authors: Lewis Ho, Joslyn Barnhart, Robert Trager, Yoshua Bengio, Miles Brundage, Allison Carnegie, Rumman Chowdhury, Allan Dafoe, Gillian Hadfield, Margaret Levi, Duncan Snidal

    Abstract: International institutions may have an important role to play in ensuring advanced AI systems benefit humanity. International collaborations can unlock AI's ability to further sustainable development, and coordination of regulatory efforts can reduce obstacles to innovation and the spread of benefits. Conversely, the potential dangerous capabilities of powerful and general-purpose AI systems creat… ▽ More

    Submitted 11 July, 2023; v1 submitted 10 July, 2023; originally announced July 2023.

    Comments: 19 pages, 2 figures, fixed rendering issues

    ACM Class: K.4.1

  23. arXiv:2307.03718  [pdf, other

    cs.CY cs.AI

    Frontier AI Regulation: Managing Emerging Risks to Public Safety

    Authors: Markus Anderljung, Joslyn Barnhart, Anton Korinek, Jade Leung, Cullen O'Keefe, Jess Whittlestone, Shahar Avin, Miles Brundage, Justin Bullock, Duncan Cass-Beggs, Ben Chang, Tantum Collins, Tim Fist, Gillian Hadfield, Alan Hayes, Lewis Ho, Sara Hooker, Eric Horvitz, Noam Kolt, Jonas Schuett, Yonadav Shavit, Divya Siddarth, Robert Trager, Kevin Wolf

    Abstract: Advanced AI models hold the promise of tremendous benefits for humanity, but society needs to proactively manage the accompanying risks. In this paper, we focus on what we term "frontier AI" models: highly capable foundation models that could possess dangerous capabilities sufficient to pose severe risks to public safety. Frontier AI models pose a distinct regulatory challenge: dangerous capabilit… ▽ More

    Submitted 7 November, 2023; v1 submitted 6 July, 2023; originally announced July 2023.

    Comments: Update July 11th: - Added missing footnote back in. - Adjusted author order (mistakenly non-alphabetical among the first 6 authors) and adjusted affiliations (Jess Whittlestone's affiliation was mistagged and Gillian Hadfield had SRI added to her affiliations) Updated September 4th: Various typos

  24. arXiv:2306.10994  [pdf, other

    cs.DB

    Efficient Generalized Temporal Pattern Mining in Big Time Series Using Mutual Information

    Authors: Van Long Ho, Nguyen Ho, Torben Bach Pedersen, Panagiotis Papapetrou

    Abstract: Big time series are increasingly available from an ever wider range of IoT-enabled sensors deployed in various environments. Significant insights can be gained by mining temporal patterns from these time series. Temporal pattern mining (TPM) extends traditional pattern mining by adding event time intervals into extracted patterns, making them more expressive at the expense of increased time and sp… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

    Comments: arXiv admin note: text overlap with arXiv:2010.03653

  25. arXiv:2305.15324  [pdf, other

    cs.AI

    Model evaluation for extreme risks

    Authors: Toby Shevlane, Sebastian Farquhar, Ben Garfinkel, Mary Phuong, Jess Whittlestone, Jade Leung, Daniel Kokotajlo, Nahema Marchal, Markus Anderljung, Noam Kolt, Lewis Ho, Divya Siddarth, Shahar Avin, Will Hawkins, Been Kim, Iason Gabriel, Vijay Bolina, Jack Clark, Yoshua Bengio, Paul Christiano, Allan Dafoe

    Abstract: Current approaches to building general-purpose AI systems tend to produce systems with both beneficial and harmful capabilities. Further progress in AI development could lead to capabilities that pose extreme risks, such as offensive cyber capabilities or strong manipulation skills. We explain why model evaluation is critical for addressing extreme risks. Developers must be able to identify danger… ▽ More

    Submitted 22 September, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: Fixed typos; added citation

    ACM Class: K.4.1

  26. arXiv:2305.10589  [pdf, other

    cs.CV

    INCLG: Inpainting for Non-Cleft Lip Generation with a Multi-Task Image Processing Network

    Authors: Shuang Chen, Amir Atapour-Abarghouei, Edmond S. L. Ho, Hubert P. H. Shum

    Abstract: We present a software that predicts non-cleft facial images for patients with cleft lip, thereby facilitating the understanding, awareness and discussion of cleft lip surgeries. To protect patients privacy, we design a software framework using image inpainting, which does not require cleft lip images for training, thereby mitigating the risk of model leakage. We implement a novel multi-task archit… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

  27. arXiv:2304.00858  [pdf, other

    cs.CV

    Focalized Contrastive View-invariant Learning for Self-supervised Skeleton-based Action Recognition

    Authors: Qianhui Men, Edmond S. L. Ho, Hubert P. H. Shum, Howard Leung

    Abstract: Learning view-invariant representation is a key to improving feature discrimination power for skeleton-based action recognition. Existing approaches cannot effectively remove the impact of viewpoint due to the implicit view-dependent representations. In this work, we propose a self-supervised framework called Focalized Contrastive View-invariant Learning (FoCoViL), which significantly suppresses t… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

  28. arXiv:2212.13900  [pdf, other

    cs.IR cs.AI cs.LG

    POIBERT: A Transformer-based Model for the Tour Recommendation Problem

    Authors: Ngai Lam Ho, Kwan Hui Lim

    Abstract: Tour itinerary planning and recommendation are challenging problems for tourists visiting unfamiliar cities. Many tour recommendation algorithms only consider factors such as the location and popularity of Points of Interest (POIs) but their solutions may not align well with the user's own preferences and other location constraints. Additionally, these solutions do not take into consideration of t… ▽ More

    Submitted 16 December, 2022; originally announced December 2022.

    Comments: Accepted to the 2022 IEEE International Conference on Big Data (BigData2022)

  29. arXiv:2211.08277  [pdf, other

    cs.LG physics.soc-ph q-bio.PE

    SPADE4: Sparsity and Delay Embedding based Forecasting of Epidemics

    Authors: Esha Saha, Lam Si Tung Ho, Giang Tran

    Abstract: Predicting the evolution of diseases is challenging, especially when the data availability is scarce and incomplete. The most popular tools for modelling and predicting infectious disease epidemics are compartmental models. They stratify the population into compartments according to health status and model the dynamics of these compartments using dynamical systems. However, these predefined system… ▽ More

    Submitted 13 June, 2023; v1 submitted 11 November, 2022; originally announced November 2022.

    Comments: 24 pages, 13 figures, 2 tables

    Journal ref: Bull.Math.Bio.85.8 (2023) 71

  30. arXiv:2209.05709  [pdf, ps, other

    cs.LG cs.AI

    Generalization Bounds for Deep Transfer Learning Using Majority Predictor Accuracy

    Authors: Cuong N. Nguyen, Lam Si Tung Ho, Vu Dinh, Tal Hassner, Cuong V. Nguyen

    Abstract: We analyze new generalization bounds for deep learning models trained by transfer learning from a source to a target task. Our bounds utilize a quantity called the majority predictor accuracy, which can be computed efficiently from data. We show that our theory is useful in practice since it implies that the majority predictor accuracy can be used as a transferability measure, a fact that is also… ▽ More

    Submitted 12 September, 2022; originally announced September 2022.

    Comments: 5 pages, Paper published at the International Symposium on Information Theory and Its Applications (ISITA 2022)

  31. arXiv:2209.02824  [pdf, other

    cs.CV cs.LG eess.IV

    CP-AGCN: Pytorch-based Attention Informed Graph Convolutional Network for Identifying Infants at Risk of Cerebral Palsy

    Authors: Haozheng Zhang, Edmond S. L. Ho, Hubert P. H. Shum

    Abstract: Early prediction is clinically considered one of the essential parts of cerebral palsy (CP) treatment. We propose to implement a low-cost and interpretable classification system for supporting CP prediction based on General Movement Assessment (GMA). We design a Pytorch-based attention-informed graph convolutional network to early identify infants at risk of CP from skeletal data extracted from RG… ▽ More

    Submitted 6 September, 2022; originally announced September 2022.

  32. arXiv:2208.08848  [pdf, other

    cs.CV

    A Two-stream Convolutional Network for Musculoskeletal and Neurological Disorders Prediction

    Authors: Manli Zhu, Qianhui Men, Edmond S. L. Ho, Howard Leung, Hubert P. H. Shum

    Abstract: Musculoskeletal and neurological disorders are the most common causes of walking problems among older people, and they often lead to diminished quality of life. Analyzing walking motion data manually requires trained professionals and the evaluations may not always be objective. To facilitate early diagnosis, recent deep learning-based methods have shown promising results for automated analysis, w… ▽ More

    Submitted 18 August, 2022; originally announced August 2022.

    Comments: Journal of Medical Systems

  33. arXiv:2208.01149  [pdf, other

    cs.CV

    A Feasibility Study on Image Inpainting for Non-cleft Lip Generation from Patients with Cleft Lip

    Authors: Shuang Chen, Amir Atapour-Abarghouei, Jane Kerby, Edmond S. L. Ho, David C. G. Sainsbury, Sophie Butterworth, Hubert P. H. Shum

    Abstract: A Cleft lip is a congenital abnormality requiring surgical repair by a specialist. The surgeon must have extensive experience and theoretical knowledge to perform surgery, and Artificial Intelligence (AI) method has been proposed to guide surgeons in improving surgical outcomes. If AI can be used to predict what a repaired cleft lip would look like, surgeons could use it as an adjunct to adjust th… ▽ More

    Submitted 1 August, 2022; originally announced August 2022.

    Comments: 4 pages, 2 figures, BHI 2022

  34. arXiv:2208.00774  [pdf, other

    cs.GR cs.CV

    Interaction Mix and Match: Synthesizing Close Interaction using Conditional Hierarchical GAN with Multi-Hot Class Embedding

    Authors: Aman Goel, Qianhui Men, Edmond S. L. Ho

    Abstract: Synthesizing multi-character interactions is a challenging task due to the complex and varied interactions between the characters. In particular, precise spatiotemporal alignment between characters is required in generating close interactions such as dancing and fighting. Existing work in generating multi-character interactions focuses on generating a single type of reactive motion for a given seq… ▽ More

    Submitted 4 August, 2022; v1 submitted 23 July, 2022; originally announced August 2022.

    Comments: Accepted to SCA 2022 (will be published in CGF)

  35. Deep Learning for Classification of Thyroid Nodules on Ultrasound: Validation on an Independent Dataset

    Authors: Jingxi Weng, Benjamin Wildman-Tobriner, Mateusz Buda, Jichen Yang, Lisa M. Ho, Brian C. Allen, Wendy L. Ehieli, Chad M. Miller, Jikai Zhang, Maciej A. Mazurowski

    Abstract: Objectives: The purpose is to apply a previously validated deep learning algorithm to a new thyroid nodule ultrasound image dataset and compare its performances with radiologists. Methods: Prior study presented an algorithm which is able to detect thyroid nodules and then make malignancy classifications with two ultrasound images. A multi-task deep convolutional neural network was trained from 127… ▽ More

    Submitted 4 May, 2023; v1 submitted 27 July, 2022; originally announced July 2022.

    Comments: Clinical Imaging (2023)

  36. arXiv:2207.06828  [pdf, other

    cs.CV cs.LG

    Pose-based Tremor Classification for Parkinson's Disease Diagnosis from Video

    Authors: Haozheng Zhang, Edmond S. L. Ho, Xiatian Zhang, Hubert P. H. Shum

    Abstract: Parkinson's disease (PD) is a progressive neurodegenerative disorder that results in a variety of motor dysfunction symptoms, including tremors, bradykinesia, rigidity and postural instability. The diagnosis of PD mainly relies on clinical experience rather than a definite medical test, and the diagnostic accuracy is only about 73-84% since it is challenged by the subjective opinions or experience… ▽ More

    Submitted 14 July, 2022; originally announced July 2022.

    Comments: MICCAI 2022

  37. Interaction-aware Decision-making for Automated Vehicles using Social Value Orientation

    Authors: Luca Crosato, Hubert P. H. Shum, Edmond S. L. Ho, Chongfeng Wei

    Abstract: Motion control algorithms in the presence of pedestrians are critical for the development of safe and reliable Autonomous Vehicles (AVs). Traditional motion control algorithms rely on manually designed decision-making policies which neglect the mutual interactions between AVs and pedestrians. On the other hand, recent advances in Deep Reinforcement Learning allow for the automatic learning of poli… ▽ More

    Submitted 12 July, 2022; originally announced July 2022.

  38. arXiv:2207.05733  [pdf, other

    cs.CV cs.AI

    A Skeleton-aware Graph Convolutional Network for Human-Object Interaction Detection

    Authors: Manli Zhu, Edmond S. L. Ho, Hubert P. H. Shum

    Abstract: Detecting human-object interactions is essential for comprehensive understanding of visual scenes. In particular, spatial connections between humans and objects are important cues for reasoning interactions. To this end, we propose a skeleton-aware graph convolutional network for human-object interaction detection, named SGCN4HOI. Our network exploits the spatial connections between human keypoint… ▽ More

    Submitted 11 July, 2022; originally announced July 2022.

    Comments: Accepted by IEEE SMC 2022

  39. arXiv:2206.14604  [pdf, other

    cs.DB

    Mining Seasonal Temporal Patterns in Time Series

    Authors: Van Long Ho, Nguyen Ho, Torben Bach Pedersen

    Abstract: Very large time series are increasingly available from an ever wider range of IoT-enabled sensors, from which significant insights can be obtained through mining temporal patterns from them. A useful type of patterns found in many real-world applications exhibits periodic occurrences, and is thus called seasonal temporal pattern (STP). Compared to regular patterns, mining seasonal temporal pattern… ▽ More

    Submitted 9 January, 2023; v1 submitted 28 June, 2022; originally announced June 2022.

  40. arXiv:2204.13584  [pdf, ps, other

    eess.SP cs.AI cs.CV cs.LG

    Predicting Sleeping Quality using Convolutional Neural Networks

    Authors: Vidya Rohini Konanur Sathish, Wai Lok Woo, Edmond S. L. Ho

    Abstract: Identifying sleep stages and patterns is an essential part of diagnosing and treating sleep disorders. With the advancement of smart technologies, sensor data related to sleeping patterns can be captured easily. In this paper, we propose a Convolution Neural Network (CNN) architecture that improves the classification performance. In particular, we benchmark the classification performance from diff… ▽ More

    Submitted 24 April, 2022; originally announced April 2022.

    ACM Class: I.2.10

  41. arXiv:2204.11357  [pdf, ps, other

    cs.LG cs.CR cs.NI

    Improving Deep Learning Model Robustness Against Adversarial Attack by Increasing the Network Capacity

    Authors: Marco Marchetti, Edmond S. L. Ho

    Abstract: Nowadays, we are more and more reliant on Deep Learning (DL) models and thus it is essential to safeguard the security of these systems. This paper explores the security issues in Deep Learning and analyses, through the use of experiments, the way forward to build more resilient models. Experiments are conducted to identify the strengths and weaknesses of a new approach to improve the robustness o… ▽ More

    Submitted 24 April, 2022; originally announced April 2022.

    ACM Class: I.2.10

  42. arXiv:2204.10997  [pdf, other

    cs.CV cs.LG

    Cerebral Palsy Prediction with Frequency Attention Informed Graph Convolutional Networks

    Authors: Haozheng Zhang, Hubert P. H. Shum, Edmond S. L. Ho

    Abstract: Early diagnosis and intervention are clinically considered the paramount part of treating cerebral palsy (CP), so it is essential to design an efficient and interpretable automatic prediction system for CP. We highlight a significant difference between CP infants' frequency of human movement and that of the healthy group, which improves prediction performance. However, the existing deep learning-b… ▽ More

    Submitted 28 March, 2023; v1 submitted 23 April, 2022; originally announced April 2022.

  43. arXiv:2204.09131  [pdf, other

    cs.DB

    A Unified Approach for Multi-Scale Synchronous Correlation Search in Big Time Series -- Full Version

    Authors: Nguyen Ho, Van Long Ho, Torben Bach Pedersen, Mai Vu, Christophe A. N. Biscio

    Abstract: The wide deployment of IoT sensors has enabled the collection of very big time series across different domains, from which advanced analytics can be performed to find unknown relationships, most importantly the correlations between them. However, current approaches for correlation search on time series are limited to only a single temporal scale and simple types of relations, and cannot handle noi… ▽ More

    Submitted 19 April, 2022; originally announced April 2022.

    Comments: 18 pages

  44. arXiv:2203.08220  [pdf, other

    cs.CR cs.AR

    Power-Based Side-Channel Attack for AES Key Extraction on the ATMega328 Microcontroller

    Authors: Utsav Banerjee, Lisa Ho, Skanda Koppula

    Abstract: We demonstrate the extraction of an AES secret key from flash memory on the ATMega328 microcontroller (the microcontroller used on the popular Arduino Genuino/Uno board). We loaded a standard AVR-architecture AES-128 implementation onto the chip and encrypted randomly chosen plaintexts with several different keys. We measured the chip's power consumption during encryption, correlated observed powe… ▽ More

    Submitted 13 March, 2022; originally announced March 2022.

    Comments: MIT 6.858 Class Project

  45. arXiv:2111.10243  [pdf, other

    math.ST cs.LG

    Posterior concentration and fast convergence rates for generalized Bayesian learning

    Authors: Lam Si Tung Ho, Binh T. Nguyen, Vu Dinh, Duy Nguyen

    Abstract: In this paper, we study the learning rate of generalized Bayes estimators in a general setting where the hypothesis class can be uncountable and have an irregular shape, the loss function can have heavy tails, and the optimal hypothesis may not be unique. We prove that under the multi-scale Bernstein's condition, the generalized posterior distribution concentrates around the set of optimal hypothe… ▽ More

    Submitted 19 November, 2021; originally announced November 2021.

  46. arXiv:2110.00380  [pdf, other

    cs.GR cs.CV

    GAN-based Reactive Motion Synthesis with Class-aware Discriminators for Human-human Interaction

    Authors: Qianhui Men, Hubert P. H. Shum, Edmond S. L. Ho, Howard Leung

    Abstract: Creating realistic characters that can react to the users' or another character's movement can benefit computer graphics, games and virtual reality hugely. However, synthesizing such reactive motions in human-human interactions is a challenging task due to the many different ways two humans can interact. While there are a number of successful researches in adapting the generative adversarial netwo… ▽ More

    Submitted 1 October, 2021; originally announced October 2021.

  47. arXiv:2109.13061  [pdf, other

    cs.LG stat.ML

    Searching for Minimal Optimal Neural Networks

    Authors: Lam Si Tung Ho, Vu Dinh

    Abstract: Large neural network models have high predictive power but may suffer from overfitting if the training set is not large enough. Therefore, it is desirable to select an appropriate size for neural networks. The destructive approach, which starts with a large architecture and then reduces the size using a Lasso-type penalty, has been used extensively for this task. Despite its popularity, there is n… ▽ More

    Submitted 27 September, 2021; originally announced September 2021.

  48. arXiv:2109.02288  [pdf, other

    cs.CV

    Toward Realistic Single-View 3D Object Reconstruction with Unsupervised Learning from Multiple Images

    Authors: Long-Nhat Ho, Anh Tuan Tran, Quynh Phung, Minh Hoai

    Abstract: Recovering the 3D structure of an object from a single image is a challenging task due to its ill-posed nature. One approach is to utilize the plentiful photos of the same object category to learn a strong 3D shape prior for the object. This approach has successfully been demonstrated by a recent work of Wu et al. (2020), which obtained impressive 3D reconstruction networks with unsupervised learn… ▽ More

    Submitted 7 September, 2021; v1 submitted 6 September, 2021; originally announced September 2021.

    Comments: Accepted to the main ICCV 2021 conference

  49. arXiv:2108.10825  [pdf, other

    cs.LG math.NA

    Adaptive Group Lasso Neural Network Models for Functions of Few Variables and Time-Dependent Data

    Authors: Lam Si Tung Ho, Nicholas Richardson, Giang Tran

    Abstract: In this paper, we propose an adaptive group Lasso deep neural network for high-dimensional function approximation where input data are generated from a dynamical system and the target function depends on few active variables or few linear combinations of variables. We approximate the target function by a deep neural network and enforce an adaptive group Lasso constraint to the weights of a suitabl… ▽ More

    Submitted 3 December, 2021; v1 submitted 24 August, 2021; originally announced August 2021.

  50. arXiv:2108.03629  [pdf

    cs.CR

    An Anonymous On-Street Parking Authentication Scheme via Zero-Knowledge Set Membership Proof

    Authors: Jerry Chien Lin Ho, Chi-Yi Lin

    Abstract: The amount of information generated grows as more and more sensor and IoT devices are deployed in smart cities. It is of utmost importance for us to consider the privacy data leakage and compromised identity from both outside adversaries and inside abuse of data access privilege. The security assumption of the system should not solely rely on the fact that permission and access control were being… ▽ More

    Submitted 8 August, 2021; originally announced August 2021.