Search | arXiv e-print repository

doi 10.55417/fr.2024001

Fast and Modular Autonomy Software for Autonomous Racing Vehicles

Authors: Andrew Saba, Aderotimi Adetunji, Adam Johnson, Aadi Kothari, Matthew Sivaprakasam, Joshua Spisak, Prem Bharatia, Arjun Chauhan, Brendan Duff Jr., Noah Gasparro, Charles King, Ryan Larkin, Brian Mao, Micah Nye, Anjali Parashar, Joseph Attias, Aurimas Balciunas, Austin Brown, Chris Chang, Ming Gao, Cindy Heredia, Andrew Keats, Jose Lavariega, William Muckelroy III, Andre Slavescu , et al. (5 additional authors not shown)

Abstract: Autonomous motorsports aim to replicate the human racecar driver with software and sensors. As in traditional motorsports, Autonomous Racing Vehicles (ARVs) are pushed to their handling limits in multi-agent scenarios at extremely high ($\geq 150mph$) speeds. This Operational Design Domain (ODD) presents unique challenges across the autonomy stack. The Indy Autonomous Challenge (IAC) is an interna… ▽ More Autonomous motorsports aim to replicate the human racecar driver with software and sensors. As in traditional motorsports, Autonomous Racing Vehicles (ARVs) are pushed to their handling limits in multi-agent scenarios at extremely high ($\geq 150mph$) speeds. This Operational Design Domain (ODD) presents unique challenges across the autonomy stack. The Indy Autonomous Challenge (IAC) is an international competition aiming to advance autonomous vehicle development through ARV competitions. While far from challenging what a human racecar driver can do, the IAC is pushing the state of the art by facilitating full-sized ARV competitions. This paper details the MIT-Pitt-RW Team's approach to autonomous racing in the IAC. In this work, we present our modular and fast approach to agent detection, motion planning and controls to create an autonomy stack. We also provide analysis of the performance of the software stack in single and multi-agent scenarios for rapid deployment in a fast-paced competition environment. We also cover what did and did not work when deployed on a physical system the Dallara AV-21 platform and potential improvements to address these shortcomings. Finally, we convey lessons learned and discuss limitations and future directions for improvement. △ Less

Submitted 27 August, 2024; originally announced August 2024.

Comments: Published in Journal of Field Robotics

Journal ref: Field Robotics Volume 4 (2024) 1-45

arXiv:2407.20814 [pdf, other]

Embracing Fairness in Consumer Electricity Markets using an Automatic Market Maker

Authors: Shaun Sweeney, Chris King, Mark O'Malley, Robert Shorten

Abstract: As consumer flexibility becomes expected, it is important that the market mechanisms which attain that flexibility are perceived as fair. We set out fairness issues in energy markets today, and propose a market design to address them. Consumption is categorised as either essential or flexible with different prices and reliability levels for each. Prices are generated by an Automatic Market Maker (… ▽ More As consumer flexibility becomes expected, it is important that the market mechanisms which attain that flexibility are perceived as fair. We set out fairness issues in energy markets today, and propose a market design to address them. Consumption is categorised as either essential or flexible with different prices and reliability levels for each. Prices are generated by an Automatic Market Maker (AMM) based on instantaneous scarcity and resource is allocated using a novel Fair Play algorithm. We empirically show the performance of the system over 1 year for 101 UK households and benchmark its performance against more classical approaches. △ Less

Submitted 30 July, 2024; originally announced July 2024.

Comments: Under review for inclusion in Special Issue of Applied Energy on `(R)Evolution of Electricity Markets: Designing Smart Electricity Markets for a Decarbonized World'

arXiv:2406.04557 [pdf, other]

Countrywide natural experiment reveals impact of built environment on physical activity

Authors: Tim Althoff, Boris Ivanovic, Jennifer L. Hicks, Scott L. Delp, Abby C. King, Jure Leskovec

Abstract: While physical activity is critical to human health, most people do not meet recommended guidelines. More walkable built environments have the potential to increase activity across the population. However, previous studies on the built environment and physical activity have led to mixed findings, possibly due to methodological limitations such as small cohorts, few or single locations, over-relian… ▽ More While physical activity is critical to human health, most people do not meet recommended guidelines. More walkable built environments have the potential to increase activity across the population. However, previous studies on the built environment and physical activity have led to mixed findings, possibly due to methodological limitations such as small cohorts, few or single locations, over-reliance on self-reported measures, and cross-sectional designs. Here, we address these limitations by leveraging a large U.S. cohort of smartphone users (N=2,112,288) to evaluate within-person longitudinal behavior changes that occurred over 248,266 days of objectively-measured physical activity across 7,447 relocations among 1,609 U.S. cities. By analyzing the results of this natural experiment, which exposed individuals to differing built environments, we find that increases in walkability are associated with significant increases in physical activity after relocation (and vice versa). These changes hold across subpopulations of different genders, age, and body-mass index (BMI), and are sustained over three months after moving.The added activity observed after moving to a more walkable location is predominantly composed of moderate-to-vigorous physical activity (MVPA), which is linked to an array of associated health benefits across the life course. A simulation experiment demonstrates that substantial walkability improvements (i.e., bringing all US locations to the walkability level of Chicago or Philadelphia) may lead to 10.3% or 33 million more Americans meeting aerobic physical activity guidelines. Evidence against residential self-selection confounding is reported. Our findings provide robust evidence supporting the importance of the built environment in directly improving health-enhancing physical activity, in addition to offering potential guidance for public policy activities in this area. △ Less

Submitted 6 June, 2024; originally announced June 2024.

arXiv:2405.15965 [pdf, other]

What is a Goldilocks Face Verification Test Set?

Authors: Haiyu Wu, Sicong Tian, Aman Bhatta, Jacob Gutierrez, Grace Bezold, Genesis Argueta, Karl Ricanek Jr., Michael C. King, Kevin W. Bowyer

Abstract: Face Recognition models are commonly trained with web-scraped datasets containing millions of images and evaluated on test sets emphasizing pose, age and mixed attributes. With train and test sets both assembled from web-scraped images, it is critical to ensure disjoint sets of identities between train and test sets. However, existing train and test sets have not considered this. Moreover, as accu… ▽ More Face Recognition models are commonly trained with web-scraped datasets containing millions of images and evaluated on test sets emphasizing pose, age and mixed attributes. With train and test sets both assembled from web-scraped images, it is critical to ensure disjoint sets of identities between train and test sets. However, existing train and test sets have not considered this. Moreover, as accuracy levels become saturated, such as LFW $>99.8\%$, more challenging test sets are needed. We show that current train and test sets are generally not identity- or even image-disjoint, and that this results in an optimistic bias in the estimated accuracy. In addition, we show that identity-disjoint folds are important in the 10-fold cross-validation estimate of test accuracy. To better support continued advances in face recognition, we introduce two "Goldilocks" test sets, Hadrian and Eclipse. The former emphasizes challenging facial hairstyles and latter emphasizes challenging over- and under-exposure conditions. Images in both datasets are from a large, controlled-acquisition (not web-scraped) dataset, so they are identity- and image-disjoint with all popular training sets. Accuracy for these new test sets generally falls below that observed on LFW, CPLFW, CALFW, CFP-FP and AgeDB-30, showing that these datasets represent important dimensions for improvement of face recognition. The datasets are available at: \url{https://github.com/HaiyuWu/SOTA-Face-Recognition-Train-and-Test} △ Less

Submitted 24 May, 2024; originally announced May 2024.

arXiv:2403.00770 [pdf, other]

doi 10.1016/j.chaos.2023.114305

Blockchain Metrics and Indicators in Cryptocurrency Trading

Authors: Juan C. King, Roberto Dale, José M. Amigó

Abstract: The objective of this paper is the construction of new indicators that can be useful to operate in the cryptocurrency market. These indicators are based on public data obtained from the blockchain network, specifically from the nodes that make up Bitcoin mining. Therefore, our analysis is unique to that network. The results obtained with numerical simulations of algorithmic trading and prediction… ▽ More The objective of this paper is the construction of new indicators that can be useful to operate in the cryptocurrency market. These indicators are based on public data obtained from the blockchain network, specifically from the nodes that make up Bitcoin mining. Therefore, our analysis is unique to that network. The results obtained with numerical simulations of algorithmic trading and prediction via statistical models and Machine Learning demonstrate the importance of variables such as the hash rate, the difficulty of mining or the cost per transaction when it comes to trade Bitcoin assets or predict the direction of price. Variables obtained from the blockchain network will be called here blockchain metrics. The corresponding indicators (inspired by the "Hash Ribbon") perform well in locating buy signals. From our results, we conclude that such blockchain indicators allow obtaining information with a statistical advantage in the highly volatile cryptocurrency market. △ Less

Submitted 11 February, 2024; originally announced March 2024.

Comments: 26 pages; 14 figures

Journal ref: Solitons & Fractals, 178, 114305 (2024)

arXiv:2309.05180 [pdf, other]

What's color got to do with it? Face recognition in grayscale

Authors: Aman Bhatta, Domingo Mery, Haiyu Wu, Joyce Annan, Micheal C. King, Kevin W. Bowyer

Abstract: State-of-the-art deep CNN face matchers are typically created using extensive training sets of color face images. Our study reveals that such matchers attain virtually identical accuracy when trained on either grayscale or color versions of the training set, even when the evaluation is done using color test images. Furthermore, we demonstrate that shallower models, lacking the capacity to model co… ▽ More State-of-the-art deep CNN face matchers are typically created using extensive training sets of color face images. Our study reveals that such matchers attain virtually identical accuracy when trained on either grayscale or color versions of the training set, even when the evaluation is done using color test images. Furthermore, we demonstrate that shallower models, lacking the capacity to model complex representations, rely more heavily on low-level features such as those associated with color. As a result, they display diminished accuracy when trained with grayscale images. We then consider possible causes for deeper CNN face matchers "not seeing color". Popular web-scraped face datasets actually have 30 to 60% of their identities with one or more grayscale images. We analyze whether this grayscale element in the training set impacts the accuracy achieved, and conclude that it does not. We demonstrate that using only grayscale images for both training and testing achieves accuracy comparable to that achieved using only color images for deeper models. This holds true for both real and synthetic training datasets. HSV color space, which separates chroma and luma information, does not improve the network's learning about color any more than in the RGB color space. We then show that the skin region of an individual's images in a web-scraped training set exhibits significant variation in their mapping to color space. This suggests that color carries limited identity-specific information. We also show that when the first convolution layer is restricted to a single filter, models learn a grayscale conversion filter and pass a grayscale version of the input color image to the next layer. Finally, we demonstrate that leveraging the lower per-image storage for grayscale to increase the number of images in the training set can improve accuracy of the face recognition model. △ Less

Submitted 2 July, 2024; v1 submitted 10 September, 2023; originally announced September 2023.

Comments: This is replacement version of the previous arxiv submission: 2309.05180 (Our Deep CNN Face Matchers Have Developed Achromatopsia). The past version is published in CVPRW and available in IEEE proceedings. This submitted version is an extension of the conference paper

arXiv:2309.04447 [pdf, other]

Impact of Blur and Resolution on Demographic Disparities in 1-to-Many Facial Identification

Authors: Aman Bhatta, Gabriella Pangelinan, Michael C. King, Kevin W. Bowyer

Abstract: Most studies to date that have examined demographic variations in face recognition accuracy have analyzed 1-to-1 matching accuracy, using images that could be described as "government ID quality". This paper analyzes the accuracy of 1-to-many facial identification across demographic groups, and in the presence of blur and reduced resolution in the probe image as might occur in "surveillance camera… ▽ More Most studies to date that have examined demographic variations in face recognition accuracy have analyzed 1-to-1 matching accuracy, using images that could be described as "government ID quality". This paper analyzes the accuracy of 1-to-many facial identification across demographic groups, and in the presence of blur and reduced resolution in the probe image as might occur in "surveillance camera quality" images. Cumulative match characteristic curves (CMC) are not appropriate for comparing propensity for rank-one recognition errors across demographics, and so we use three metrics for our analysis: (1) the well-known d' metric between mated and non-mated score distributions, and introduced in this work, (2) absolute score difference between thresholds in the high-similarity tail of the non-mated and the low-similarity tail of the mated distribution, and (3) distribution of (mated - non-mated rank-one scores) across the set of probe images. We find that demographic variation in 1-to-many accuracy does not entirely follow what has been observed in 1-to-1 matching accuracy. Also, different from 1-to-1 accuracy, demographic comparison of 1-to-many accuracy can be affected by different numbers of identities and images across demographics. More importantly, we show that increased blur in the probe image, or reduced resolution of the face in the probe image, can significantly increase the false positive identification rate. And we show that the demographic variation in these high blur or low resolution conditions is much larger for male / female than for African-American / Caucasian. The point that 1-to-many accuracy can potentially collapse in the context of processing "surveillance camera quality" probe images against a "government ID quality" gallery is an important one. △ Less

Submitted 23 January, 2024; v1 submitted 8 September, 2023; originally announced September 2023.

Comments: 9 pages, 8 figures, Conference submission

arXiv:2308.02273 [pdf, other]

Is Scenario Generation Ready for SOTIF? A Systematic Literature Review

Authors: Lukas Birkemeyer, Christian King, Ina Schaefer

Abstract: Scenario-based testing is considered state-of-the-art to verify and validate Advanced Driver Assistance Systems or Automated Driving Systems. Due to the official launch of the SOTIF-standard (ISO 21448), scenario-based testing becomes more and more relevant for releasing those Highly Automated Driving Systems. However, an essential missing detail prevent the practical application of the SOTIF-stan… ▽ More Scenario-based testing is considered state-of-the-art to verify and validate Advanced Driver Assistance Systems or Automated Driving Systems. Due to the official launch of the SOTIF-standard (ISO 21448), scenario-based testing becomes more and more relevant for releasing those Highly Automated Driving Systems. However, an essential missing detail prevent the practical application of the SOTIF-standard: How to practically generate scenarios for scenario-based testing? In this paper, we perform a Systematic Literature Review to identify techniques that generate scenarios complying with requirements of the SOTIF-standard. We classify existing scenario generation techniques and evaluate the characteristics of generated scenarios wrt. SOTIF requirements. We investigate which details of the real-world are covered by generated scenarios, whether scenarios are specific for a system under test or generic, and whether scenarios are designed to minimize the set of unknown and hazardous scenarios. We conclude that scenarios generated with existing techniques do not comply with requirements implied by the SOTIF-standard; hence, we propose directions for future research. △ Less

Submitted 8 August, 2023; v1 submitted 4 August, 2023; originally announced August 2023.

Comments: accepted at IEEE ITSC 2023

arXiv:2305.06307 [pdf, other]

Analysis of Adversarial Image Manipulations

Authors: Ahsi Lo, Gabriella Pangelinan, Michael C. King

Abstract: As virtual and physical identity grow increasingly intertwined, the importance of privacy and security in the online sphere becomes paramount. In recent years, multiple news stories have emerged of private companies scraping web content and doing research with or selling the data. Images uploaded online can be scraped without users' consent or knowledge. Users of social media platforms whose image… ▽ More As virtual and physical identity grow increasingly intertwined, the importance of privacy and security in the online sphere becomes paramount. In recent years, multiple news stories have emerged of private companies scraping web content and doing research with or selling the data. Images uploaded online can be scraped without users' consent or knowledge. Users of social media platforms whose images are scraped may be at risk of being identified in other uploaded images or in real-world identification situations. This paper investigates how simple, accessible image manipulation techniques affect the accuracy of facial recognition software in identifying an individual's various face images based on one unique image. △ Less

Submitted 10 May, 2023; originally announced May 2023.

arXiv:2304.07175 [pdf, other]

Exploring Causes of Demographic Variations In Face Recognition Accuracy

Authors: Gabriella Pangelinan, K. S. Krishnapriya, Vitor Albiero, Grace Bezold, Kai Zhang, Kushal Vangara, Michael C. King, Kevin W. Bowyer

Abstract: In recent years, media reports have called out bias and racism in face recognition technology. We review experimental results exploring several speculated causes for asymmetric cross-demographic performance. We consider accuracy differences as represented by variations in non-mated (impostor) and / or mated (genuine) distributions for 1-to-1 face matching. Possible causes explored include differen… ▽ More In recent years, media reports have called out bias and racism in face recognition technology. We review experimental results exploring several speculated causes for asymmetric cross-demographic performance. We consider accuracy differences as represented by variations in non-mated (impostor) and / or mated (genuine) distributions for 1-to-1 face matching. Possible causes explored include differences in skin tone, face size and shape, imbalance in number of identities and images in the training data, and amount of face visible in the test data ("face pixels"). We find that demographic differences in face pixel information of the test images appear to most directly impact the resultant differences in face recognition accuracy. △ Less

Submitted 14 April, 2023; originally announced April 2023.

arXiv:2303.02570 [pdf, other]

Time Associated Meta Learning for Clinical Prediction

Authors: Hao Liu, Muhan Zhang, Zehao Dong, Lecheng Kong, Yixin Chen, Bradley Fritz, Dacheng Tao, Christopher King

Abstract: Rich Electronic Health Records (EHR), have created opportunities to improve clinical processes using machine learning methods. Prediction of the same patient events at different time horizons can have very different applications and interpretations; however, limited number of events in each potential time window hurts the effectiveness of conventional machine learning algorithms. We propose a nove… ▽ More Rich Electronic Health Records (EHR), have created opportunities to improve clinical processes using machine learning methods. Prediction of the same patient events at different time horizons can have very different applications and interpretations; however, limited number of events in each potential time window hurts the effectiveness of conventional machine learning algorithms. We propose a novel time associated meta learning (TAML) method to make effective predictions at multiple future time points. We view time-associated disease prediction as classification tasks at multiple time points. Such closely-related classification tasks are an excellent candidate for model-based meta learning. To address the sparsity problem after task splitting, TAML employs a temporal information sharing strategy to augment the number of positive samples and include the prediction of related phenotypes or events in the meta-training phase. We demonstrate the effectiveness of TAML on multiple clinical datasets, where it consistently outperforms a range of strong baselines. We also develop a MetaEHR package for implementing both time-associated and time-independent few-shot prediction on EHR data. △ Less

Submitted 4 March, 2023; originally announced March 2023.

arXiv:2301.11608 [pdf, other]

A Multi-View Joint Learning Framework for Embedding Clinical Codes and Text Using Graph Neural Networks

Authors: Lecheng Kong, Christopher King, Bradley Fritz, Yixin Chen

Abstract: Learning to represent free text is a core task in many clinical machine learning (ML) applications, as clinical text contains observations and plans not otherwise available for inference. State-of-the-art methods use large language models developed with immense computational resources and training data; however, applying these models is challenging because of the highly varying syntax and vocabula… ▽ More Learning to represent free text is a core task in many clinical machine learning (ML) applications, as clinical text contains observations and plans not otherwise available for inference. State-of-the-art methods use large language models developed with immense computational resources and training data; however, applying these models is challenging because of the highly varying syntax and vocabulary in clinical free text. Structured information such as International Classification of Disease (ICD) codes often succinctly abstracts the most important facts of a clinical encounter and yields good performance, but is often not as available as clinical text in real-world scenarios. We propose a \textbf{multi-view learning framework} that jointly learns from codes and text to combine the availability and forward-looking nature of text and better performance of ICD codes. The learned text embeddings can be used as inputs to predictive algorithms independent of the ICD codes during inference. Our approach uses a Graph Neural Network (GNN) to process ICD codes, and Bi-LSTM to process text. We apply Deep Canonical Correlation Analysis (DCCA) to enforce the two views to learn a similar representation of each patient. In experiments using planned surgical procedure text, our model outperforms BERT models fine-tuned to clinical data, and in experiments using diverse text in MIMIC-III, our model is competitive to a fine-tuned BERT at a tiny fraction of its computational effort. △ Less

Submitted 27 January, 2023; originally announced January 2023.

arXiv:2211.04442 [pdf, other]

Algorithmic Bias in Machine Learning Based Delirium Prediction

Authors: Sandhya Tripathi, Bradley A Fritz, Michael S Avidan, Yixin Chen, Christopher R King

Abstract: Although prediction models for delirium, a commonly occurring condition during general hospitalization or post-surgery, have not gained huge popularity, their algorithmic bias evaluation is crucial due to the existing association between social determinants of health and delirium risk. In this context, using MIMIC-III and another academic hospital dataset, we present some initial experimental evid… ▽ More Although prediction models for delirium, a commonly occurring condition during general hospitalization or post-surgery, have not gained huge popularity, their algorithmic bias evaluation is crucial due to the existing association between social determinants of health and delirium risk. In this context, using MIMIC-III and another academic hospital dataset, we present some initial experimental evidence showing how sociodemographic features such as sex and race can impact the model performance across subgroups. With this work, our intent is to initiate a discussion about the intersectionality effects of old age, race and socioeconomic factors on the early-stage detection and prevention of delirium using ML. △ Less

Submitted 26 November, 2022; v1 submitted 8 November, 2022; originally announced November 2022.

Comments: Extended Abstract presented at Machine Learning for Health (ML4H) symposium 2022, November 28th, 2022, New Orleans, United States & Virtual, http://www.ml4h.cc, 14 pages

arXiv:2210.07356 [pdf, other]

Consistency and Accuracy of CelebA Attribute Values

Authors: Haiyu Wu, Grace Bezold, Manuel Günther, Terrance Boult, Michael C. King, Kevin W. Bowyer

Abstract: We report the first systematic analysis of the experimental foundations of facial attribute classification. Two annotators independently assigning attribute values shows that only 12 of 40 common attributes are assigned values with >= 95% consistency, and three (high cheekbones, pointed nose, oval face) have essentially random consistency. Of 5,068 duplicate face appearances in CelebA, attributes… ▽ More We report the first systematic analysis of the experimental foundations of facial attribute classification. Two annotators independently assigning attribute values shows that only 12 of 40 common attributes are assigned values with >= 95% consistency, and three (high cheekbones, pointed nose, oval face) have essentially random consistency. Of 5,068 duplicate face appearances in CelebA, attributes have contradicting values on from 10 to 860 of the 5,068 duplicates. Manual audit of a subset of CelebA estimates error rates as high as 40% for (no beard=false), even though the labeling consistency experiment indicates that no beard could be assigned with >= 95% consistency. Selecting the mouth slightly open (MSO) for deeper analysis, we estimate the error rate for (MSO=true) at about 20% and (MSO=false) at about 2%. A corrected version of the MSO attribute values enables learning a model that achieves higher accuracy than previously reported for MSO. Corrected values for CelebA MSO are available at https://github.com/HaiyuWu/CelebAMSO. △ Less

Submitted 16 April, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

arXiv:2210.04417 [pdf, other]

Self-explaining Hierarchical Model for Intraoperative Time Series

Authors: Dingwen Li, Bing Xue, Christopher King, Bradley Fritz, Michael Avidan, Joanna Abraham, Chenyang Lu

Abstract: Major postoperative complications are devastating to surgical patients. Some of these complications are potentially preventable via early predictions based on intraoperative data. However, intraoperative data comprise long and fine-grained multivariate time series, prohibiting the effective learning of accurate models. The large gaps associated with clinical events and protocols are usually ignore… ▽ More Major postoperative complications are devastating to surgical patients. Some of these complications are potentially preventable via early predictions based on intraoperative data. However, intraoperative data comprise long and fine-grained multivariate time series, prohibiting the effective learning of accurate models. The large gaps associated with clinical events and protocols are usually ignored. Moreover, deep models generally lack transparency. Nevertheless, the interpretability is crucial to assist clinicians in planning for and delivering postoperative care and timely interventions. Towards this end, we propose a hierarchical model combining the strength of both attention and recurrent models for intraoperative time series. We further develop an explanation module for the hierarchical model to interpret the predictions by providing contributions of intraoperative data in a fine-grained manner. Experiments on a large dataset of 111,888 surgeries with multiple outcomes and an external high-resolution ICU dataset show that our model can achieve strong predictive performance (i.e., high accuracy) and offer robust interpretations (i.e., high transparency) for predicted outcomes based on intraoperative time series. △ Less

Submitted 9 October, 2022; originally announced October 2022.

arXiv:2207.03536 [pdf, other]

Deep Learning to Jointly Schema Match, Impute, and Transform Databases

Authors: Sandhya Tripathi, Bradley A. Fritz, Mohamed Abdelhack, Michael S. Avidan, Yixin Chen, Christopher R. King

Abstract: An applied problem facing all areas of data science is harmonizing data sources. Joining data from multiple origins with unmapped and only partially overlapping features is a prerequisite to developing and testing robust, generalizable algorithms, especially in health care. We approach this issue in the common but difficult case of numeric features such as nearly Gaussian and binary features, wher… ▽ More An applied problem facing all areas of data science is harmonizing data sources. Joining data from multiple origins with unmapped and only partially overlapping features is a prerequisite to developing and testing robust, generalizable algorithms, especially in health care. We approach this issue in the common but difficult case of numeric features such as nearly Gaussian and binary features, where unit changes and variable shift make simple matching of univariate summaries unsuccessful. We develop two novel procedures to address this problem. First, we demonstrate multiple methods of "fingerprinting" a feature based on its associations to other features. In the setting of even modest prior information, this allows most shared features to be accurately identified. Second, we demonstrate a deep learning algorithm for translation between databases. Unlike prior approaches, our algorithm takes advantage of discovered mappings while identifying surrogates for unshared features and learning transformations. In synthetic and real-world experiments using two electronic health record databases, our algorithms outperform existing baselines for matching variable sets, while jointly learning to impute unshared or transformed variables. △ Less

Submitted 22 June, 2022; originally announced July 2022.

arXiv:2206.04867 [pdf, other]

The Gender Gap in Face Recognition Accuracy Is a Hairy Problem

Authors: Aman Bhatta, Vítor Albiero, Kevin W. Bowyer, Michael C. King

Abstract: It is broadly accepted that there is a "gender gap" in face recognition accuracy, with females having higher false match and false non-match rates. However, relatively little is known about the cause(s) of this gender gap. Even the recent NIST report on demographic effects lists "analyze cause and effect" under "what we did not do". We first demonstrate that female and male hairstyles have importa… ▽ More It is broadly accepted that there is a "gender gap" in face recognition accuracy, with females having higher false match and false non-match rates. However, relatively little is known about the cause(s) of this gender gap. Even the recent NIST report on demographic effects lists "analyze cause and effect" under "what we did not do". We first demonstrate that female and male hairstyles have important differences that impact face recognition accuracy. In particular, compared to females, male facial hair contributes to creating a greater average difference in appearance between different male faces. We then demonstrate that when the data used to estimate recognition accuracy is balanced across gender for how hairstyles occlude the face, the initially observed gender gap in accuracy largely disappears. We show this result for two different matchers, and analyzing images of Caucasians and of African-Americans. These results suggest that future research on demographic variation in accuracy should include a check for balanced quality of the test data as part of the problem formulation. To promote reproducible research, matchers, attribute classifiers, and datasets used in this research are/will be publicly available. △ Less

Submitted 10 June, 2022; originally announced June 2022.

arXiv:2206.01881 [pdf, other]

Face Recognition Accuracy Across Demographics: Shining a Light Into the Problem

Authors: Haiyu Wu, Vítor Albiero, K. S. Krishnapriya, Michael C. King, Kevin W. Bowyer

Abstract: We explore varying face recognition accuracy across demographic groups as a phenomenon partly caused by differences in face illumination. We observe that for a common operational scenario with controlled image acquisition, there is a large difference in face region brightness between African-American and Caucasian, and also a smaller difference between male and female. We show that impostor image… ▽ More We explore varying face recognition accuracy across demographic groups as a phenomenon partly caused by differences in face illumination. We observe that for a common operational scenario with controlled image acquisition, there is a large difference in face region brightness between African-American and Caucasian, and also a smaller difference between male and female. We show that impostor image pairs with both faces under-exposed, or both overexposed, have an increased false match rate (FMR). Conversely, image pairs with strongly different face brightness have a decreased similarity measure. We propose a brightness information metric to measure variation in brightness in the face and show that face brightness that is too low or too high has reduced information in the face region, providing a cause for the lower accuracy. Based on this, for operational scenarios with controlled image acquisition, illumination should be adjusted for each individual to obtain appropriate face image brightness. This is the first work that we are aware of to explore how the level of brightness of the skin region in a pair of face images (rather than a single image) impacts face recognition accuracy, and to evaluate this as a systematic factor causing unequal accuracy across demographics. The code is at https://github.com/HaiyuWu/FaceBrightness. △ Less

Submitted 16 April, 2023; v1 submitted 3 June, 2022; originally announced June 2022.

arXiv:2203.12620 [pdf]

Evaluation of Non-Invasive Thermal Imaging for detection of Viability of Onchocerciasis worms

Authors: Ronak Dedhiya, Siva Teja Kakileti, Goutham Deepu, Kanchana Gopinath, Nicholas Opoku, Christopher King, Geetha Manjunath

Abstract: Onchocerciasis is causing blindness in over half a million people in the world today. Drug development for the disease is crippled as there is no way of measuring effectiveness of the drug without an invasive procedure. Drug efficacy measurement through assessment of viability of onchocerca worms requires the patients to undergo nodulectomy which is invasive, expensive, time-consuming, skill-depen… ▽ More Onchocerciasis is causing blindness in over half a million people in the world today. Drug development for the disease is crippled as there is no way of measuring effectiveness of the drug without an invasive procedure. Drug efficacy measurement through assessment of viability of onchocerca worms requires the patients to undergo nodulectomy which is invasive, expensive, time-consuming, skill-dependent, infrastructure dependent and lengthy process. In this paper, we discuss the first-ever study that proposes use of machine learning over thermal imaging to non-invasively and accurately predict the viability of worms. The key contributions of the paper are (i) a unique thermal imaging protocol along with pre-processing steps such as alignment, registration and segmentation to extract interpretable features (ii) extraction of relevant semantic features (iii) development of accurate classifiers for detecting the existence of viable worms in a nodule. When tested on a prospective test data of 30 participants with 48 palpable nodules, we achieved an Area Under the Curve (AUC) of 0.85. △ Less

Submitted 23 March, 2022; originally announced March 2022.

Comments: It is submitted to EMBC 2022 and is currently under review

arXiv:2202.12210 [pdf, other]

BERTVision -- A Parameter-Efficient Approach for Question Answering

Authors: Siduo Jiang, Cristopher Benge, William Casey King

Abstract: We present a highly parameter efficient approach for Question Answering that significantly reduces the need for extended BERT fine-tuning. Our method uses information from the hidden state activations of each BERT transformer layer, which is discarded during typical BERT inference. Our best model achieves maximal BERT performance at a fraction of the training time and GPU or TPU expense. Performan… ▽ More We present a highly parameter efficient approach for Question Answering that significantly reduces the need for extended BERT fine-tuning. Our method uses information from the hidden state activations of each BERT transformer layer, which is discarded during typical BERT inference. Our best model achieves maximal BERT performance at a fraction of the training time and GPU or TPU expense. Performance is further improved by ensembling our model with BERTs predictions. Furthermore, we find that near optimal performance can be achieved for QA span annotation using less training data. Our experiments show that this approach works well not only for span annotation, but also for classification, suggesting that it may be extensible to a wider range of tasks. △ Less

Submitted 24 February, 2022; originally announced February 2022.

Comments: 7 pages, 11 with appendix

MSC Class: 68T07 ACM Class: I.2

arXiv:2202.07869 [pdf, other]

Automating the Learning of Inverse Kinematics for Robotic Arms with Redundant DoFs

Authors: Chi-Kai Ho, Chung-Ta King

Abstract: Inverse Kinematics (IK) solves the problem of mapping from the Cartesian space to the joint configuration space of a robotic arm. It has a wide range of applications in areas such as computer graphics, protein structure prediction, and robotics. With the vast advances of artificial neural networks (NNs), many researchers recently turned to data-driven approaches to solving the IK problem. Unfortun… ▽ More Inverse Kinematics (IK) solves the problem of mapping from the Cartesian space to the joint configuration space of a robotic arm. It has a wide range of applications in areas such as computer graphics, protein structure prediction, and robotics. With the vast advances of artificial neural networks (NNs), many researchers recently turned to data-driven approaches to solving the IK problem. Unfortunately, NNs become inadequate for robotic arms with redundant Degrees-of-Freedom (DoFs). This is because such arms may have multiple angle solutions to reach the same desired pose, while typical NNs only implement one-to-one mapping functions, which associate just one consistent output for a given input. In order to train usable NNs to solve the IK problem, most existing works employ customized training datasets, in which every desired pose only has one angle solution. This inevitably limits the generalization and automation of the proposed approaches. This paper breaks through at two fronts: (1) a systematic and mechanical approach to training data collection that covers the entire working space of the robotic arm, and can be fully automated and done only once after the arm is developed; and (2) a novel NN-based framework that can leverage the redundant DoFs to produce multiple angle solutions to any given desired pose of the robotic arm. The latter is especially useful for robotic applications such as obstacle avoidance and posture imitation. △ Less

Submitted 29 August, 2022; v1 submitted 16 February, 2022; originally announced February 2022.

arXiv:2112.14656 [pdf, other]

doi 10.1109/TIFS.2021.3135750

Gendered Differences in Face Recognition Accuracy Explained by Hairstyles, Makeup, and Facial Morphology

Authors: Vítor Albiero, Kai Zhang, Michael C. King, Kevin W. Bowyer

Abstract: Media reports have accused face recognition of being ''biased'', ''sexist'' and ''racist''. There is consensus in the research literature that face recognition accuracy is lower for females, who often have both a higher false match rate and a higher false non-match rate. However, there is little published research aimed at identifying the cause of lower accuracy for females. For instance, the 2019… ▽ More Media reports have accused face recognition of being ''biased'', ''sexist'' and ''racist''. There is consensus in the research literature that face recognition accuracy is lower for females, who often have both a higher false match rate and a higher false non-match rate. However, there is little published research aimed at identifying the cause of lower accuracy for females. For instance, the 2019 Face Recognition Vendor Test that documents lower female accuracy across a broad range of algorithms and datasets also lists ''Analyze cause and effect'' under the heading ''What we did not do''. We present the first experimental analysis to identify major causes of lower face recognition accuracy for females on datasets where previous research has observed this result. Controlling for equal amount of visible face in the test images mitigates the apparent higher false non-match rate for females. Additional analysis shows that makeup-balanced datasets further improves females to achieve lower false non-match rates. Finally, a clustering experiment suggests that images of two different females are inherently more similar than of two different males, potentially accounting for a difference in false match rates. △ Less

Submitted 29 December, 2021; originally announced December 2021.

Comments: arXiv admin note: substantial text overlap with arXiv:2008.06989

arXiv:2107.08574 [pdf, other]

A Modulation Layer to Increase Neural Network Robustness Against Data Quality Issues

Authors: Mohamed Abdelhack, Jiaming Zhang, Sandhya Tripathi, Bradley A Fritz, Daniel Felsky, Michael S Avidan, Yixin Chen, Christopher R King

Abstract: Data missingness and quality are common problems in machine learning, especially for high-stakes applications such as healthcare. Developers often train machine learning models on carefully curated datasets using only high quality data; however, this reduces the utility of such models in production environments. We propose a novel neural network modification to mitigate the impacts of low quality… ▽ More Data missingness and quality are common problems in machine learning, especially for high-stakes applications such as healthcare. Developers often train machine learning models on carefully curated datasets using only high quality data; however, this reduces the utility of such models in production environments. We propose a novel neural network modification to mitigate the impacts of low quality and missing data which involves replacing the fixed weights of a fully-connected layer with a function of an additional input. This is inspired from neuromodulation in biological neural networks where the cortex can up- and down-regulate inputs based on their reliability and the presence of other data. In testing, with reliability scores as a modulating signal, models with modulating layers were found to be more robust against degradation of data quality, including additional missingness. These models are superior to imputation as they save on training time by completely skipping the imputation process and further allow the introduction of other data quality measures that imputation cannot handle. Our results suggest that explicitly accounting for reduced information quality with a modulating fully connected layer can enable the deployment of artificial intelligence systems in real-time applications. △ Less

Submitted 22 April, 2023; v1 submitted 18 July, 2021; originally announced July 2021.

Journal ref: Transactions on Machine Learning Research 2023

arXiv:2104.14685 [pdf, other]

Analysis of Manual and Automated Skin Tone Assignments for Face Recognition Applications

Authors: KS Krishnapriya, Michael C. King, Kevin W. Bowyer

Abstract: News reports have suggested that darker skin tone causes an increase in face recognition errors. The Fitzpatrick scale is widely used in dermatology to classify sensitivity to sun exposure and skin tone. In this paper, we analyze a set of manual Fitzpatrick skin type assignments and also employ the individual typology angle to automatically estimate the skin tone from face images. The set of manua… ▽ More News reports have suggested that darker skin tone causes an increase in face recognition errors. The Fitzpatrick scale is widely used in dermatology to classify sensitivity to sun exposure and skin tone. In this paper, we analyze a set of manual Fitzpatrick skin type assignments and also employ the individual typology angle to automatically estimate the skin tone from face images. The set of manual skin tone rating experiments shows that there are inconsistencies between human raters that are difficult to eliminate. Efforts to automate skin tone rating suggest that it is particularly challenging on images collected without a calibration object in the scene. However, after the color-correction, the level of agreement between automated and manual approaches is found to be 96% or better for the MORPH images. To our knowledge, this is the first work to: (a) examine the consistency of manual skin tone ratings across observers, (b) document that there is substantial variation in the rating of the same image by different observers even when exemplar images are given for guidance and all images are color-corrected, and (c) compare manual versus automated skin tone ratings. △ Less

Submitted 29 April, 2021; originally announced April 2021.

arXiv:2104.13803 [pdf, other]

Does Face Recognition Error Echo Gender Classification Error?

Authors: Ying Qiu, Vítor Albiero, Michael C. King, Kevin W. Bowyer

Abstract: This paper is the first to explore the question of whether images that are classified incorrectly by a face analytics algorithm (e.g., gender classification) are any more or less likely to participate in an image pair that results in a face recognition error. We analyze results from three different gender classification algorithms (one open-source and two commercial), and two face recognition algo… ▽ More This paper is the first to explore the question of whether images that are classified incorrectly by a face analytics algorithm (e.g., gender classification) are any more or less likely to participate in an image pair that results in a face recognition error. We analyze results from three different gender classification algorithms (one open-source and two commercial), and two face recognition algorithms (one open-source and one commercial), on image sets representing four demographic groups (African-American female and male, Caucasian female and male). For impostor image pairs, our results show that pairs in which one image has a gender classification error have a better impostor distribution than pairs in which both images have correct gender classification, and so are less likely to generate a false match error. For genuine image pairs, our results show that individuals whose images have a mix of correct and incorrect gender classification have a worse genuine distribution (increased false non-match rate) compared to individuals whose images all have correct gender classification. Thus, compared to images that generate correct gender classification, images that generate gender classification errors do generate a different pattern of recognition errors, both better (false match) and worse (false non-match). △ Less

Submitted 28 April, 2021; originally announced April 2021.

arXiv:2103.08241 [pdf, other]

Reinforcement Learning with Algorithms from Probabilistic Structure Estimation

Authors: Jonathan P. Epperlein, Roman Overko, Sergiy Zhuk, Christopher King, Djallel Bouneffouf, Andrew Cullen, Robert Shorten

Abstract: Reinforcement learning (RL) algorithms aim to learn optimal decisions in unknown environments through experience of taking actions and observing the rewards gained. In some cases, the environment is not influenced by the actions of the RL agent, in which case the problem can be modeled as a contextual multi-armed bandit and lightweight myopic algorithms can be employed. On the other hand, when the… ▽ More Reinforcement learning (RL) algorithms aim to learn optimal decisions in unknown environments through experience of taking actions and observing the rewards gained. In some cases, the environment is not influenced by the actions of the RL agent, in which case the problem can be modeled as a contextual multi-armed bandit and lightweight myopic algorithms can be employed. On the other hand, when the RL agent's actions affect the environment, the problem must be modeled as a Markov decision process and more complex RL algorithms are required which take the future effects of actions into account. Moreover, in practice, it is often unknown from the outset whether or not the agent's actions will impact the environment and it is therefore not possible to determine which RL algorithm is most fitting. In this work, we propose to avoid this difficult decision entirely and incorporate a choice mechanism into our RL framework. Rather than assuming a specific problem structure, we use a probabilistic structure estimation procedure based on a likelihood-ratio (LR) test to make a more informed selection of learning algorithm. We derive a sufficient condition under which myopic policies are optimal, present an LR test for this condition, and derive a bound on the regret of our framework. We provide examples of real-world scenarios where our framework is needed and provide extensive simulations to validate our approach. △ Less

Submitted 1 June, 2022; v1 submitted 15 March, 2021; originally announced March 2021.

arXiv:2102.01740 [pdf, other]

Reliability Analysis of Artificial Intelligence Systems Using Recurrent Events Data from Autonomous Vehicles

Authors: Yili Hong, Jie Min, Caleb B. King, William Q. Meeker

Abstract: Artificial intelligence (AI) systems have become increasingly common and the trend will continue. Examples of AI systems include autonomous vehicles (AV), computer vision, natural language processing, and AI medical experts. To allow for safe and effective deployment of AI systems, the reliability of such systems needs to be assessed. Traditionally, reliability assessment is based on reliability t… ▽ More Artificial intelligence (AI) systems have become increasingly common and the trend will continue. Examples of AI systems include autonomous vehicles (AV), computer vision, natural language processing, and AI medical experts. To allow for safe and effective deployment of AI systems, the reliability of such systems needs to be assessed. Traditionally, reliability assessment is based on reliability test data and the subsequent statistical modeling and analysis. The availability of reliability data for AI systems, however, is limited because such data are typically sensitive and proprietary. The California Department of Motor Vehicles (DMV) oversees and regulates an AV testing program, in which many AV manufacturers are conducting AV road tests. Manufacturers participating in the program are required to report recurrent disengagement events to California DMV. This information is being made available to the public. In this paper, we use recurrent disengagement events as a representation of the reliability of the AI system in AV, and propose a statistical framework for modeling and analyzing the recurrent events data from AV driving tests. We use traditional parametric models in software reliability and propose a new nonparametric model based on monotonic splines to describe the event process. We develop inference procedures for selecting the best models, quantifying uncertainty, and testing heterogeneity in the event process. We then analyze the recurrent events data from four AV manufacturers, and make inferences on the reliability of the AI systems in AV. We also describe how the proposed analysis can be applied to assess the reliability of other AI systems. △ Less

Submitted 2 February, 2021; originally announced February 2021.

Comments: 30 pages, 9 figures

arXiv:2011.02036 [pdf, other]

(Un)fairness in Post-operative Complication Prediction Models

Authors: Sandhya Tripathi, Bradley A. Fritz, Mohamed Abdelhack, Michael S. Avidan, Yixin Chen, Christopher R. King

Abstract: With the current ongoing debate about fairness, explainability and transparency of machine learning models, their application in high-impact clinical decision-making systems must be scrutinized. We consider a real-life example of risk estimation before surgery and investigate the potential for bias or unfairness of a variety of algorithms. Our approach creates transparent documentation of potentia… ▽ More With the current ongoing debate about fairness, explainability and transparency of machine learning models, their application in high-impact clinical decision-making systems must be scrutinized. We consider a real-life example of risk estimation before surgery and investigate the potential for bias or unfairness of a variety of algorithms. Our approach creates transparent documentation of potential bias so that the users can apply the model carefully. We augment a model-card like analysis using propensity scores with a decision-tree based guide for clinicians that would identify predictable shortcomings of the model. In addition to functioning as a guide for users, we propose that it can guide the algorithm development and informatics team to focus on data sources and structures that can address these shortcomings. △ Less

Submitted 3 November, 2020; originally announced November 2020.

arXiv:2010.12446 [pdf, other]

Estimation of Cardiac Valve Annuli Motion with Deep Learning

Authors: Eric Kerfoot, Carlos Escudero King, Tefvik Ismail, David Nordsletten, Renee Miller

Abstract: Valve annuli motion and morphology, measured from non-invasive imaging, can be used to gain a better understanding of healthy and pathological heart function. Measurements such as long-axis strain as well as peak strain rates provide markers of systolic function. Likewise, early and late-diastolic filling velocities are used as indicators of diastolic function. Quantifying global strains, however,… ▽ More Valve annuli motion and morphology, measured from non-invasive imaging, can be used to gain a better understanding of healthy and pathological heart function. Measurements such as long-axis strain as well as peak strain rates provide markers of systolic function. Likewise, early and late-diastolic filling velocities are used as indicators of diastolic function. Quantifying global strains, however, requires a fast and precise method of tracking long-axis motion throughout the cardiac cycle. Valve landmarks such as the insertion of leaflets into the myocardial wall provide features that can be tracked to measure global long-axis motion. Feature tracking methods require initialisation, which can be time-consuming in studies with large cohorts. Therefore, this study developed and trained a neural network to identify ten features from unlabeled long-axis MR images: six mitral valve points from three long-axis views, two aortic valve points and two tricuspid valve points. This study used manual annotations of valve landmarks in standard 2-, 3- and 4-chamber long-axis images collected in clinical scans to train the network. The accuracy in the identification of these ten features, in pixel distance, was compared with the accuracy of two commonly used feature tracking methods as well as the inter-observer variability of manual annotations. Clinical measures, such as valve landmark strain and motion between end-diastole and end-systole, are also presented to illustrate the utility and robustness of the method. △ Less

Submitted 23 October, 2020; originally announced October 2020.

Comments: 10 pages, STACOM abstract

arXiv:2002.00065 [pdf, other]

Analysis of Gender Inequality In Face Recognition Accuracy

Authors: Vítor Albiero, Krishnapriya K. S., Kushal Vangara, Kai Zhang, Michael C. King, Kevin W. Bowyer

Abstract: We present a comprehensive analysis of how and why face recognition accuracy differs between men and women. We show that accuracy is lower for women due to the combination of (1) the impostor distribution for women having a skew toward higher similarity scores, and (2) the genuine distribution for women having a skew toward lower similarity scores. We show that this phenomenon of the impostor and… ▽ More We present a comprehensive analysis of how and why face recognition accuracy differs between men and women. We show that accuracy is lower for women due to the combination of (1) the impostor distribution for women having a skew toward higher similarity scores, and (2) the genuine distribution for women having a skew toward lower similarity scores. We show that this phenomenon of the impostor and genuine distributions for women shifting closer towards each other is general across datasets of African-American, Caucasian, and Asian faces. We show that the distribution of facial expressions may differ between male/female, but that the accuracy difference persists for image subsets rated confidently as neutral expression. The accuracy difference also persists for image subsets rated as close to zero pitch angle. Even when removing images with forehead partially occluded by hair/hat, the same impostor/genuine accuracy difference persists. We show that the female genuine distribution improves when only female images without facial cosmetics are used, but that the female impostor distribution also degrades at the same time. Lastly, we show that the accuracy difference persists even if a state-of-the-art deep learning method is trained from scratch using training data explicitly balanced between male and female images and subjects. △ Less

Submitted 31 January, 2020; originally announced February 2020.

Comments: Paper will appear at The 2nd Workshop on Demographic Variation in the Performance of Biometric Systems at WACV 2020

arXiv:1911.06396 [pdf, other]

Does Face Recognition Accuracy Get Better With Age? Deep Face Matchers Say No

Authors: Vítor Albiero, Kevin W. Bowyer, Kushal Vangara, Michael C. King

Abstract: Previous studies generally agree that face recognition accuracy is higher for older persons than for younger persons. But most previous studies were before the wave of deep learning matchers, and most considered accuracy only in terms of the verification rate for genuine pairs. This paper investigates accuracy for age groups 16-29, 30-49 and 50-70, using three modern deep CNN matchers, and conside… ▽ More Previous studies generally agree that face recognition accuracy is higher for older persons than for younger persons. But most previous studies were before the wave of deep learning matchers, and most considered accuracy only in terms of the verification rate for genuine pairs. This paper investigates accuracy for age groups 16-29, 30-49 and 50-70, using three modern deep CNN matchers, and considers differences in the impostor and genuine distributions as well as verification rates and ROC curves. We find that accuracy is lower for older persons and higher for younger persons. In contrast, a pre deep learning matcher on the same dataset shows the traditional result of higher accuracy for older persons, although its overall accuracy is much lower than that of the deep learning matchers. Comparing the impostor and genuine distributions, we conclude that impostor scores have a larger effect than genuine scores in causing lower accuracy for the older age group. We also investigate the effects of training data across the age groups. Our results show that fine-tuning the deep CNN models on additional images of older persons actually lowers accuracy for the older age group. Also, we fine-tune and train from scratch two models using age-balanced training datasets, and these results also show lower accuracy for older age group. These results argue that the lower accuracy for the older age group is not due to imbalance in the original training data. △ Less

Submitted 14 November, 2019; originally announced November 2019.

Comments: Paper will appear at the WACV 2020

arXiv:1907.12596 [pdf, other]

A Factored Generalized Additive Model for Clinical Decision Support in the Operating Room

Authors: Zhicheng Cui, Bradley A Fritz, Christopher R King, Michael S Avidan, Yixin Chen

Abstract: Logistic regression (LR) is widely used in clinical prediction because it is simple to deploy and easy to interpret. Nevertheless, being a linear model, LR has limited expressive capability and often has unsatisfactory performance. Generalized additive models (GAMs) extend the linear model with transformations of input features, though feature interaction is not allowed for all GAM variants. In th… ▽ More Logistic regression (LR) is widely used in clinical prediction because it is simple to deploy and easy to interpret. Nevertheless, being a linear model, LR has limited expressive capability and often has unsatisfactory performance. Generalized additive models (GAMs) extend the linear model with transformations of input features, though feature interaction is not allowed for all GAM variants. In this paper, we propose a factored generalized additive model (F-GAM) to preserve the model interpretability for targeted features while allowing a rich model for interaction with features fixed within the individual. We evaluate F-GAM on prediction of two targets, postoperative acute kidney injury and acute respiratory failure, from a single-center database. We find superior model performance of F-GAM in terms of AUPRC and AUROC compared to several other GAM implementations, random forests, support vector machine, and a deep neural network. We find that the model interpretability is good with results with high face validity. △ Less

Submitted 29 July, 2019; originally announced July 2019.

Comments: Accepted for publication in AMIA 2019 Annual Symposium

arXiv:1904.07325 [pdf]

Characterizing the Variability in Face Recognition Accuracy Relative to Race

Authors: KS Krishnapriya, Kushal Vangara, Michael C. King, Vitor Albiero, Kevin Bowyer

Abstract: Many recent news headlines have labeled face recognition technology as biased or racist. We report on a methodical investigation into differences in face recognition accuracy between African-American and Caucasian image cohorts of the MORPH dataset. We find that, for all four matchers considered, the impostor and the genuine distributions are statistically significantly different between cohorts.… ▽ More Many recent news headlines have labeled face recognition technology as biased or racist. We report on a methodical investigation into differences in face recognition accuracy between African-American and Caucasian image cohorts of the MORPH dataset. We find that, for all four matchers considered, the impostor and the genuine distributions are statistically significantly different between cohorts. For a fixed decision threshold, the African-American image cohort has a higher false match rate and a lower false non-match rate. ROC curves compare verification rates at the same false match rate, but the different cohorts achieve the same false match rate at different thresholds. This means that ROC comparisons are not relevant to operational scenarios that use a fixed decision threshold. We show that, for the ResNet matcher, the two cohorts have approximately equal separation of impostor and genuine distributions. Using ICAO compliance as a standard of image quality, we find that the initial image cohorts have unequal rates of good quality images. The ICAO-compliant subsets of the original image cohorts show improved accuracy, with the main effect being to reducing the low-similarity tail of the genuine distributions. △ Less

Submitted 8 May, 2019; v1 submitted 15 April, 2019; originally announced April 2019.

Comments: Paper will appear in the BEFA workshop at CVPR 2019

arXiv:1904.00996 [pdf, other]

doi 10.1109/JIOT.2020.2983401

Distributed Ledger Technology for IoT: Parasite Chain Attacks

Authors: Andrew Cullen, Pietro Ferraro, Christopher King, Robert Shorten

Abstract: Directed Acyclic Graph (DAG) based Distributed Ledgers can be useful in a number of applications in the IoT domain. A distributed ledger should serve as an immutable and irreversible record of transactions, however, a DAG structure is a more complicated mathematical object than its blockchain counterparts, and as a result, providing guarantees of immutability and irreversibility is more involved.… ▽ More Directed Acyclic Graph (DAG) based Distributed Ledgers can be useful in a number of applications in the IoT domain. A distributed ledger should serve as an immutable and irreversible record of transactions, however, a DAG structure is a more complicated mathematical object than its blockchain counterparts, and as a result, providing guarantees of immutability and irreversibility is more involved. In this paper, we analyse a commonly discussed attack scenario known as a parasite chain attack for the IOTA Foundation DAG based ledger. We analyse the efficacy of IOTA core MCMC algorithm using a matrix model and present an extension which improves the ledger resistance to these attacks. △ Less

Submitted 10 November, 2020; v1 submitted 21 March, 2019; originally announced April 2019.

Journal ref: in IEEE Internet of Things Journal, vol. 7, no. 8, pp. 7112-7122, Aug. 2020

arXiv:1901.07302 [pdf, other]

doi 10.1109/TAC.2019.2950873

IOTA-based Directed Acyclic Graphs without Orphans

Authors: Pietro Ferraro, Christopher King, Robert Shorten

Abstract: Directed Acylic Graphs (DAGs) are emerging as an attractive alternative to traditional blockchain architectures for distributed ledger technology (DLT). In particular DAG ledgers with stochastic attachment mechanisms potentially offer many advantages over blockchain, including scalability and faster transaction speeds. However, the random nature of the attachment mechanism coupled with the require… ▽ More Directed Acylic Graphs (DAGs) are emerging as an attractive alternative to traditional blockchain architectures for distributed ledger technology (DLT). In particular DAG ledgers with stochastic attachment mechanisms potentially offer many advantages over blockchain, including scalability and faster transaction speeds. However, the random nature of the attachment mechanism coupled with the requirement of protection against double-spend transactions leaves open the possibility that not all transactions will be eventually validated. Such transactions are said to be orphaned, and will never be validated. Our principal contribution is to propose a simple modification to the attachment mechanism for the Tangle (the IOTA DAG architecture). This modification ensures that all transactions are validated in finite time, and preserves essential features of the popular Monte-Carlo selection algorithm. In order to demonstrate these results we derive a fluid approximation for the Tangle (in the limit of infinite arrival rate) and prove that this fluid model exhibits the desired behavior. We also present simulations which validate the results for finite arrival rates. △ Less

Submitted 12 November, 2020; v1 submitted 12 December, 2018; originally announced January 2019.

Comments: This paper has been published with the title "On the stability of unverified transactions in a DAG-based Distributed Ledger"

Journal ref: in IEEE Transactions on Automatic Control, vol. 65, no. 9, pp. 3772-3783, Sept. 2020

arXiv:1807.00649 [pdf, other]

doi 10.1109/ACCESS.2018.2876766

Distributed Ledger Technology, Cyber-Physical Systems, and Social Compliance

Authors: Pietro Ferraro, Christopher King, Robert Shorten

Abstract: This paper describes how Distributed Ledger Technologies can be used to design a class of cyber-physical systems, as well as to enforce social contracts and to orchestrate the behaviour of agents trying to access a shared resource. The first part of the paper analyses the advantages and disadvantages of using Distributed Ledger Technologies architectures to implement certain control systems in an… ▽ More This paper describes how Distributed Ledger Technologies can be used to design a class of cyber-physical systems, as well as to enforce social contracts and to orchestrate the behaviour of agents trying to access a shared resource. The first part of the paper analyses the advantages and disadvantages of using Distributed Ledger Technologies architectures to implement certain control systems in an Internet of Things (IoT) setting, and then focuses on a specific type of DLT based on a Directed Acyclic Graph. In this setting we propose a set of delay differential equations to describe the dynamical behaviour of the Tangle, an IoT-inspired Directed Acyclic Graph designed for the cryptocurrency IOTA. The second part proposes an application of Distributed Ledger Technologies as a mechanism for dynamic deposit pricing, wherein the deposit of digital currency is used to orchestrate access to a network of shared resources. The pricing signal is used as a mechanism to enforce the desired level of compliance according to a predetermined set of rules. After presenting an illustrative example, we analyze the control system and provide sufficient conditions for the stability of the network. △ Less

Submitted 20 October, 2018; v1 submitted 2 July, 2018; originally announced July 2018.

Comments: This paper has been accepted for publication on the journal IEEE Access, with the title "Distributed Ledger Technology for Smart Cities, the Sharing Economy, and Social Compliance"

arXiv:1712.08348 [pdf, other]

Towards Software Development For Social Robotics Systems

Authors: Chong Sun, Jiongyan Zhang, Cong Liu, Barry Chew Bao King, Yuwei Zhang, Matthew Galle, Maria Spichkova

Abstract: In this paper we introduce the core results of the project on software development for social robotics systems. The usability of maintenance and control features is crucial for many kinds of systems, but in the case of social robotics we also have to take into account that (1) the humanoid robot physically interacts with humans, (2) the conversation with children might have different requirements… ▽ More In this paper we introduce the core results of the project on software development for social robotics systems. The usability of maintenance and control features is crucial for many kinds of systems, but in the case of social robotics we also have to take into account that (1) the humanoid robot physically interacts with humans, (2) the conversation with children might have different requirements in comparison to the conversation with adults. The results of our work were implement for the humanoid PAL REEM robot, but their core ideas can be applied for other types of humanoid robots. We developed a web-based solution that supports the management of robot-guided tours, provides recommendations for the users as well as allows for a visual analysis of the data on previous tours. △ Less

Submitted 22 December, 2017; originally announced December 2017.

arXiv:1708.03209 [pdf, ps, other]

doi 10.24963/ijcai.2017/37

Tosca: Operationalizing Commitments Over Information Protocols

Authors: Thomas C. King, Akın Günay, Amit K. Chopra, Munindar P. Singh

Abstract: The notion of commitment is widely studied as a high-level abstraction for modeling multiagent interaction. An important challenge is supporting flexible decentralized enactments of commitment specifications. In this paper, we combine recent advances on specifying commitments and information protocols. Specifically, we contribute Tosca, a technique for automatically synthesizing information protoc… ▽ More The notion of commitment is widely studied as a high-level abstraction for modeling multiagent interaction. An important challenge is supporting flexible decentralized enactments of commitment specifications. In this paper, we combine recent advances on specifying commitments and information protocols. Specifically, we contribute Tosca, a technique for automatically synthesizing information protocols from commitment specifications. Our main result is that the synthesized protocols support commitment alignment, which is the idea that agents must make compatible inferences about their commitments despite decentralization. △ Less

Submitted 10 August, 2017; originally announced August 2017.

arXiv:1704.06654 [pdf, other]

doi 10.4233/uuid:82438672-3e8b-477a-a39e-0ce189639e88

Governing Governance: A Formal Framework for Analysing Institutional Design and Enactment Governance

Authors: Thomas C. King

Abstract: This dissertation is motivated by the need, in today's globalist world, for a precise way to enable governments, organisations and other regulatory bodies to evaluate the constraints they place on themselves and others. An organisation's modus operandi is enacting and fulfilling contracts between itself and its participants. Yet, organisational contracts should respect external laws, such as those… ▽ More This dissertation is motivated by the need, in today's globalist world, for a precise way to enable governments, organisations and other regulatory bodies to evaluate the constraints they place on themselves and others. An organisation's modus operandi is enacting and fulfilling contracts between itself and its participants. Yet, organisational contracts should respect external laws, such as those setting out data privacy rights and liberties. Contracts can only be enacted by following contract law processes, which often require bilateral agreement and consideration. Governments need to legislate whilst understanding today's context of national and international governance hierarchy where law makers shun isolationism and seek to influence one another. Governments should avoid punishment by respecting constraints from international treaties and human rights charters. Governments can only enact legislation by following their own, pre-existing, law making procedures. In other words, institutions, such as laws and contracts are designed and enacted under constraints. △ Less

Submitted 21 April, 2017; originally announced April 2017.

Journal ref: SIKS Dissertation Series No. 2016-41

arXiv:1212.0469 [pdf, ps, other]

Pushing the Communication Speed Limit of a Noninvasive BCI Speller

Authors: Po T. Wang, Christine E. King, An H. Do, Zoran Nenadic

Abstract: Electroencephalogram (EEG) based brain-computer interfaces (BCI) may provide a means of communication for those affected by severe paralysis. However, the relatively low information transfer rates (ITR) of these systems, currently limited to 1 bit/sec, present a serious obstacle to their widespread adoption in both clinical and non-clinical applications. Here, we report on the development of a nov… ▽ More Electroencephalogram (EEG) based brain-computer interfaces (BCI) may provide a means of communication for those affected by severe paralysis. However, the relatively low information transfer rates (ITR) of these systems, currently limited to 1 bit/sec, present a serious obstacle to their widespread adoption in both clinical and non-clinical applications. Here, we report on the development of a novel noninvasive BCI communication system that achieves ITRs that are severalfold higher than those previously reported with similar systems. Using only 8 EEG channels, 6 healthy subjects with little to no prior BCI experience selected characters from a virtual keyboard with sustained, error-free, online ITRs in excess of 3 bit/sec. By factoring in the time spent to notify the subjects of their selection, practical, error-free typing rates as high as 12.75 character/min were achieved, which allowed subjects to correctly type a 44-character sentence in less than 3.5 minutes. We hypothesize that ITRs can be further improved by optimizing the parameters of the interface, while practical typing rates can be significantly improved by shortening the selection notification time. These results provide compelling evidence that the ITR limit of noninvasive BCIs has not yet been reached and that further investigation into this matter is both justified and necessary. △ Less

Submitted 7 February, 2013; v1 submitted 3 December, 2012; originally announced December 2012.

arXiv:1210.1223 [pdf]

Further developments in generating type-safe messaging

Authors: R. Neswold, C. King

Abstract: At ICALEPCS '09, we introduced a source code generator that allows processes to communicate safely using data types native to each host language. In this paper, we discuss further development that has occurred since the conference in Kobe, Japan, including the addition of three more client languages, an optimization in network packet size and the addition of a new protocol data type. At ICALEPCS '09, we introduced a source code generator that allows processes to communicate safely using data types native to each host language. In this paper, we discuss further development that has occurred since the conference in Kobe, Japan, including the addition of three more client languages, an optimization in network packet size and the addition of a new protocol data type. △ Less

Submitted 3 October, 2012; originally announced October 2012.

Comments: 4 pp. 13th International Conference on Accelerator and Large Experimental Physics Control Systems (ICALEPCS 2011). 10-14 Oct 2011. Grenoble, France

Report number: FERMILAB-CONF-11-594-AD

arXiv:1209.1859 [pdf, other]

Operation of a Brain-Computer Interface Walking Simulator by Users with Spinal Cord Injury

Authors: Christine E. King, Po T. Wang, Luis A. Chui, An H. Do, Zoran Nenadic

Abstract: Background: Spinal cord injury (SCI) can leave the affected individuals unable to ambulate. Since there are no restorative treatments for SCI, novel approaches such as brain-controlled prostheses have been sought. Our recent studies show that a brain-computer interface (BCI) can be used to control ambulation within a virtual reality environment (VRE), suggesting that a BCI-controlled lower extremi… ▽ More Background: Spinal cord injury (SCI) can leave the affected individuals unable to ambulate. Since there are no restorative treatments for SCI, novel approaches such as brain-controlled prostheses have been sought. Our recent studies show that a brain-computer interface (BCI) can be used to control ambulation within a virtual reality environment (VRE), suggesting that a BCI-controlled lower extremity prosthesis for ambulation may be feasible. However, the operability of our BCI has not been tested in a SCI population. Methods: Five subjects with paraplegia or tetraplegia due to SCI underwent a 10-min training session in which they alternated between kinesthetic motor imagery (KMI) of idling and walking while their electroencephalogram (EEG) were recorded. Subjects then performed a goal-oriented online task, where they utilized KMI to control the linear ambulation of an avatar and make 10 sequential stops at designated points within the VRE. Multiple online trials were performed over 5 experimental days. Results: Classification accuracy of idling and walking was estimated offline and ranged from 60.5% (p=0.0176) to 92.3% (p=1.36*10^-20) across subjects and days. In the online task, all subjects achieved purposeful control with an average performance of 7.4 +/- 2.3 successful stops in 273 +/- 51 sec (p<0.01). All subjects maintained purposeful control throughout the study, and their online performances improved over time. Conclusions: The results demonstrate that SCI subjects can purposefully operate a self-paced BCI walking simulator to complete a goal-oriented ambulation task. The operation of this BCI system requires short training, is intuitive, and robust against subject-to-subject and day-to-day neurophysiological variations. These findings indicate that BCI-controlled lower extremity prostheses for gait rehabilitation or restoration after SCI may be feasible in the future. △ Less

Submitted 9 September, 2012; originally announced September 2012.

Comments: 17 pages, 7 figures, 5 tables, supplementary video link (http://www.youtube.com/watch?v=K4Frq9pwAz8)

arXiv:1208.6057 [pdf, ps, other]

Self-paced brain-computer interface control of ambulation in a virtual reality environment

Authors: Po T. Wang, Christine E. King, Luis A. Chui, An H. Do, Zoran Nenadic

Abstract: Objective: Spinal cord injury (SCI) often leaves affected individuals unable to ambulate. Electroencephalogramme (EEG) based brain-computer interface (BCI) controlled lower extremity prostheses may restore intuitive and able-body-like ambulation after SCI. To test its feasibility, the authors developed and tested a novel EEG-based, data-driven BCI system for intuitive and self-paced control of the… ▽ More Objective: Spinal cord injury (SCI) often leaves affected individuals unable to ambulate. Electroencephalogramme (EEG) based brain-computer interface (BCI) controlled lower extremity prostheses may restore intuitive and able-body-like ambulation after SCI. To test its feasibility, the authors developed and tested a novel EEG-based, data-driven BCI system for intuitive and self-paced control of the ambulation of an avatar within a virtual reality environment (VRE). Approach: Eight able-bodied subjects and one with SCI underwent the following 10-min training session: subjects alternated between idling and walking kinaesthetic motor imageries (KMI) while their EEG were recorded and analysed to generate subject-specific decoding models. Subjects then performed a goal-oriented online task, repeated over 5 sessions, in which they utilised the KMI to control the linear ambulation of an avatar and make 10 sequential stops at designated points within the VRE. Main results: The average offline training performance across subjects was 77.2 +/- 9.5%, ranging from 64.3% (p = 0.00176) to 94.5% (p = 6.26*10^-23), with chance performance being 50%. The average online performance was 8.4 +/- 1.0 (out of 10) successful stops and 303 +/- 53 sec completion time (perfect = 211 sec). All subjects achieved performances significantly different than those of random walk (p < 0.05) in 44 of the 45 online sessions. Significance: By using a data-driven machine learning approach to decode users' KMI, this BCIVRE system enabled intuitive and purposeful self-paced control of ambulation after only a 10-minute training. The ability to achieve such BCI control with minimal training indicates that the implementation of future BCI-lower extremity prosthesis systems may be feasible. △ Less

Submitted 29 August, 2012; originally announced August 2012.

Comments: 20 pages, 7 figures, link to video supplementary material (http://youtu.be/GXmovT3BxEo)

arXiv:1208.5024 [pdf, other]

Brain-Computer Interface Controlled Robotic Gait Orthosis

Authors: An H. Do, Po T. Wang, Christine E. King, Sophia N. Chun, Zoran Nenadic

Abstract: Reliance on wheelchairs after spinal cord injury (SCI) leads to many medical co-morbidities. Treatment of these conditions contributes to the majority of SCI health care costs. Restoring able-body-like ambulation after SCI may reduce the incidence of these conditions, and increase independence and quality of life. However, no biomedical solution exists that can reverse this lost neurological funct… ▽ More Reliance on wheelchairs after spinal cord injury (SCI) leads to many medical co-morbidities. Treatment of these conditions contributes to the majority of SCI health care costs. Restoring able-body-like ambulation after SCI may reduce the incidence of these conditions, and increase independence and quality of life. However, no biomedical solution exists that can reverse this lost neurological function, and hence novel methods are needed. Brain-computer interface (BCI) controlled lower extremity prosthesis may constitute one such novel approach. One subject with able-body and one with paraplegia due to SCI underwent electroencephalogram (EEG) recording while engaged in alternating epochs of idling and walking kinesthetic motor imagery (KMI). These data were analyzed to generate an EEG prediction model for online BCI operation. A commercial robotic gait orthosis (RoGO) system (treadmill suspended), was interfaced with the BCI computer. In an online test, the subjects were tasked to ambulate using the BCI-RoGO system when prompted by computerized cues. The performance of this system was assessed with cross-correlation analysis, and omission and false alarm rates. The offline accuracy of the EEG prediction model averaged 86.3%. The cross-correlation between instructional cues and BCI-RoGO walking epochs averaged 0.812 +/- 0.048 (p-value<10^-4). There were on average 0.8 false alarms per session and no omissions. This is the first time a person with parapegia due to SCI regained basic brain-controlled ambulation, thereby indicating that restoring brain-controlled ambulation is feasible. Future work will test this system in a population of individuals with SCI. If successful, this may justify future development of invasive BCI-controlled lower extremity prostheses. This system may also be applied to incomplete SCI to improve neurological outcomes beyond those of standard physiotherapy. △ Less

Submitted 26 August, 2013; v1 submitted 24 August, 2012; originally announced August 2012.

Comments: Supplementary video (http://www.youtube.com/watch?v=W97Z8fEAQ7g and http://www.youtube.com/watch?v=HXNCwonhjG8)

Showing 1–44 of 44 results for author: King, C