Search | arXiv e-print repository

Anisotropic magnetoresistance in single cubic crystals: A theory and its verification

Authors: Yu Miao, Junwen Sun, Cunxu Gao, Desheng Xue, X. R. Wang

Abstract: A theory of anisotropic magnetoresistance (AMR) and planar Hall effect (PHE) in single cubic crystals and its experimental verifications are presented for the current in the (001) plane. In contrast to the general belief that AMR and PHE in single crystals are highly sensitive to many internal and external effects and have no universal features, the theory predicts universal angular dependencies o… ▽ More A theory of anisotropic magnetoresistance (AMR) and planar Hall effect (PHE) in single cubic crystals and its experimental verifications are presented for the current in the (001) plane. In contrast to the general belief that AMR and PHE in single crystals are highly sensitive to many internal and external effects and have no universal features, the theory predicts universal angular dependencies of longitudinal and transverse resistivity and various characteristics when magnetization rotates in the (001) plane, the plane perpendicular to the current, and the plane containing the current and [001] direction. The universal angular dependencies are verified by the experiments on Fe30Co70 single cubic crystal film. The findings provide new avenues for fundamental research and applications of AMR and PHE, because single crystals offer advantages over polycrystalline materials for band structure and crystallographic orientation engineering. △ Less

Submitted 30 November, 2023; originally announced December 2023.

arXiv:2311.15269 [pdf, other]

Tessel: Boosting Distributed Execution of Large DNN Models via Flexible Schedule Search

Authors: Zhiqi Lin, Youshan Miao, Guanbin Xu, Cheng Li, Olli Saarikivi, Saeed Maleki, Fan Yang

Abstract: Increasingly complex and diverse deep neural network (DNN) models necessitate distributing the execution across multiple devices for training and inference tasks, and also require carefully planned schedules for performance. However, existing practices often rely on predefined schedules that may not fully exploit the benefits of emerging diverse model-aware operator placement strategies. Handcraft… ▽ More Increasingly complex and diverse deep neural network (DNN) models necessitate distributing the execution across multiple devices for training and inference tasks, and also require carefully planned schedules for performance. However, existing practices often rely on predefined schedules that may not fully exploit the benefits of emerging diverse model-aware operator placement strategies. Handcrafting high-efficiency schedules can be challenging due to the large and varying schedule space. This paper presents Tessel, an automated system that searches for efficient schedules for distributed DNN training and inference for diverse operator placement strategies. To reduce search costs, Tessel leverages the insight that the most efficient schedules often exhibit repetitive pattern (repetend) across different data inputs. This leads to a two-phase approach: repetend construction and schedule completion. By exploring schedules for various operator placement strategies, Tessel significantly improves both training and inference performance. Experiments with representative DNN models demonstrate that Tessel achieves up to 5.5x training performance speedup and up to 38% inference latency reduction. △ Less

Submitted 26 November, 2023; originally announced November 2023.

Comments: The paper is accepted by HPCA 2024

arXiv:2311.12592 [pdf, other]

Visual tracking brain computer interface

Authors: Changxing Huang, Nanlin Shi, Yining Miao, Xiaogang Chen, Yijun Wang, Xiaorong Gao

Abstract: Brain-computer interfaces (BCIs) offer a way to interact with computers without relying on physical movements. Non-invasive electroencephalography (EEG)-based visual BCIs, known for efficient speed and calibration ease, face limitations in continuous tasks due to discrete stimulus design and decoding methods. To achieve continuous control, we implemented a novel spatial encoding stimulus paradigm… ▽ More Brain-computer interfaces (BCIs) offer a way to interact with computers without relying on physical movements. Non-invasive electroencephalography (EEG)-based visual BCIs, known for efficient speed and calibration ease, face limitations in continuous tasks due to discrete stimulus design and decoding methods. To achieve continuous control, we implemented a novel spatial encoding stimulus paradigm and devised a corresponding projection method to enable continuous modulation of decoded velocity. Subsequently, we conducted experiments involving 17 participants and achieved Fitt's ITR of 0.55 bps for the fixed tracking task and 0.37 bps for the random tracking task. The proposed BCI with a high Fitt's ITR was then integrated into two applications, including painting and gaming. In conclusion, this study proposed a visual BCI-based control method to go beyond discrete commands, allowing natural continuous control based on neural activity. △ Less

Submitted 21 November, 2023; originally announced November 2023.

arXiv:2311.11596 [pdf]

High-performance cVEP-BCI under minimal calibration

Authors: Yining Miao, Nanlin Shi, Changxing Huang, Yonghao Song, Xiaogang Chen, Yijun Wang, Xiaorong Gao

Abstract: The ultimate goal of brain-computer interfaces (BCIs) based on visual modulation paradigms is to achieve high-speed performance without the burden of extensive calibration. Code-modulated visual evoked potential-based BCIs (cVEP-BCIs) modulated by broadband white noise (WN) offer various advantages, including increased communication speed, expanded encoding target capabilities, and enhanced coding… ▽ More The ultimate goal of brain-computer interfaces (BCIs) based on visual modulation paradigms is to achieve high-speed performance without the burden of extensive calibration. Code-modulated visual evoked potential-based BCIs (cVEP-BCIs) modulated by broadband white noise (WN) offer various advantages, including increased communication speed, expanded encoding target capabilities, and enhanced coding flexibility. However, the complexity of the spatial-temporal patterns under broadband stimuli necessitates extensive calibration for effective target identification in cVEP-BCIs. Consequently, the information transfer rate (ITR) of cVEP-BCI under limited calibration usually stays around 100 bits per minute (bpm), significantly lagging behind state-of-the-art steady-state visual evoked potential-based BCIs (SSVEP-BCIs), which achieve rates above 200 bpm. To enhance the performance of cVEP-BCIs with minimal calibration, we devised an efficient calibration stage involving a brief single-target flickering, lasting less than a minute, to extract generalizable spatial-temporal patterns. Leveraging the calibration data, we developed two complementary methods to construct cVEP temporal patterns: the linear modeling method based on the stimulus sequence and the transfer learning techniques using cross-subject data. As a result, we achieved the highest ITR of 250 bpm under a minute of calibration, which has been shown to be comparable to the state-of-the-art SSVEP paradigms. In summary, our work significantly improved the cVEP performance under few-shot learning, which is expected to expand the practicality and usability of cVEP-BCIs. △ Less

Submitted 20 November, 2023; originally announced November 2023.

Comments: 35 pages, 5 figures

arXiv:2311.11275 [pdf, other]

Vital Signs Estimation Using a 26 GHz Multi-Beam Communication Testbed

Authors: Miquel Sellés Valls, Sofie Pollin, Ying Wang, Rizqi Hersyandika, Andre Kokkeler, Yang Miao

Abstract: This paper presents a novel pipeline for vital sign monitoring using a 26 GHz multi-beam communication testbed. In context of Joint Communication and Sensing (JCAS), the advanced communication capability at millimeter-wave bands is comparable to the radio resource of radars and is promising to sense the surrounding environment. Being able to communicate and sense the vital sign of humans present i… ▽ More This paper presents a novel pipeline for vital sign monitoring using a 26 GHz multi-beam communication testbed. In context of Joint Communication and Sensing (JCAS), the advanced communication capability at millimeter-wave bands is comparable to the radio resource of radars and is promising to sense the surrounding environment. Being able to communicate and sense the vital sign of humans present in the environment will enable new vertical services of telecommunication, i.e., remote health monitoring. The proposed processing pipeline leverages spatially orthogonal beams to estimate the vital sign - breath rate and heart rate - of single and multiple persons in static scenarios from the raw Channel State Information samples. We consider both monostatic and bistatic sensing scenarios. For monostatic scenario, we employ the phase time-frequency calibration and Discrete Wavelet Transform to improve the performance compared to the conventional Fast Fourier Transform based methods. For bistatic scenario, we use K-means clustering algorithm to extract multi-person vital signs due to the distinct frequency-domain signal feature between single and multi-person scenarios. The results show that the estimated breath rate and heart rate reach below 2 beats per minute (bpm) error compared to the reference captured by on-body sensor for the single-person monostatic sensing scenario with body-transceiver distance up to 2 m, and the two-person bistatic sensing scenario with BS-UE distance up to 4 m. The presented work does not optimize the OFDM waveform parameters for sensing; it demonstrates a promising JCAS proof-of-concept in contact-free vital sign monitoring using mmWave multi-beam communication systems. △ Less

Submitted 13 December, 2023; v1 submitted 19 November, 2023; originally announced November 2023.

arXiv:2311.07608 [pdf, other]

MuST: Multimodal Spatiotemporal Graph-Transformer for Hospital Readmission Prediction

Authors: Yan Miao, Lequan Yu

Abstract: Hospital readmission prediction is considered an essential approach to decreasing readmission rates, which is a key factor in assessing the quality and efficacy of a healthcare system. Previous studies have extensively utilized three primary modalities, namely electronic health records (EHR), medical images, and clinical notes, to predict hospital readmissions. However, the majority of these studi… ▽ More Hospital readmission prediction is considered an essential approach to decreasing readmission rates, which is a key factor in assessing the quality and efficacy of a healthcare system. Previous studies have extensively utilized three primary modalities, namely electronic health records (EHR), medical images, and clinical notes, to predict hospital readmissions. However, the majority of these studies did not integrate information from all three modalities or utilize the spatiotemporal relationships present in the dataset. This study introduces a novel model called the Multimodal Spatiotemporal Graph-Transformer (MuST) for predicting hospital readmissions. By employing Graph Convolution Networks and temporal transformers, we can effectively capture spatial and temporal dependencies in EHR and chest radiographs. We then propose a fusion transformer to combine the spatiotemporal features from the two modalities mentioned above with the features from clinical notes extracted by a pre-trained, domain-specific transformer. We assess the effectiveness of our methods using the latest publicly available dataset, MIMIC-IV. The experimental results indicate that the inclusion of multimodal features in MuST improves its performance in comparison to unimodal methods. Furthermore, our proposed pipeline outperforms the current leading methods in the prediction of hospital readmissions. △ Less

Submitted 11 November, 2023; originally announced November 2023.

arXiv:2311.06517 [pdf, other]

BClean: A Bayesian Data Cleaning System

Authors: Jianbin Qin, Sifan Huang, Yaoshu Wang, Jing Zhu, Yifan Zhang, Yukai Miao, Rui Mao, Makoto Onizuka, Chuan Xiao

Abstract: There is a considerable body of work on data cleaning which employs various principles to rectify erroneous data and transform a dirty dataset into a cleaner one. One of prevalent approaches is probabilistic methods, including Bayesian methods. However, existing probabilistic methods often assume a simplistic distribution (e.g., Gaussian distribution), which is frequently underfitted in practice,… ▽ More There is a considerable body of work on data cleaning which employs various principles to rectify erroneous data and transform a dirty dataset into a cleaner one. One of prevalent approaches is probabilistic methods, including Bayesian methods. However, existing probabilistic methods often assume a simplistic distribution (e.g., Gaussian distribution), which is frequently underfitted in practice, or they necessitate experts to provide a complex prior distribution (e.g., via a programming language). This requirement is both labor-intensive and costly, rendering these methods less suitable for real-world applications. In this paper, we propose BClean, a Bayesian Cleaning system that features automatic Bayesian network construction and user interaction. We recast the data cleaning problem as a Bayesian inference that fully exploits the relationships between attributes in the observed dataset and any prior information provided by users. To this end, we present an automatic Bayesian network construction method that extends a structure learning-based functional dependency discovery method with similarity functions to capture the relationships between attributes. Furthermore, our system allows users to modify the generated Bayesian network in order to specify prior information or correct inaccuracies identified by the automatic generation process. We also design an effective scoring model (called the compensative scoring model) necessary for the Bayesian inference. To enhance the efficiency of data cleaning, we propose several approximation strategies for the Bayesian inference, including graph partitioning, domain pruning, and pre-detection. By evaluating on both real-world and synthetic datasets, we demonstrate that BClean is capable of achieving an F-measure of up to 0.9 in data cleaning, outperforming existing Bayesian methods by 2% and other data cleaning methods by 15%. △ Less

Submitted 11 November, 2023; originally announced November 2023.

Comments: Our source code is available at https://github.com/yyssl88/BClean

arXiv:2310.14966 [pdf, other]

doi 10.21468/SciPostPhys.16.5.129

Rational Q-systems at Root of Unity I. Closed Chains

Authors: Jue Hou, Yunfeng Jiang, Yuan Miao

Abstract: The solution of Bethe ansatz equations for XXZ spin chain with the parameter $q$ being a root of unity is infamously subtle. In this work, we develop the rational $Q$-system for this case, which offers a systematic way to find all physical solutions of the Bethe ansatz equations at root of unity. The construction contains two parts. In the first part, we impose additional constraints to the ration… ▽ More The solution of Bethe ansatz equations for XXZ spin chain with the parameter $q$ being a root of unity is infamously subtle. In this work, we develop the rational $Q$-system for this case, which offers a systematic way to find all physical solutions of the Bethe ansatz equations at root of unity. The construction contains two parts. In the first part, we impose additional constraints to the rational $Q$-system. These constraints eliminate the so-called Fabricius-McCoy (FM) string solutions, yielding all primitive solutions. In the second part, we give a simple procedure to construct the descendant tower of any given primitive state. The primitive solutions together with their descendant towers constitute the complete Hilbert space. We test our proposal by extensive numerical checks and apply it to compute the torus partition function of the 6-vertex model at root of unity. △ Less

Submitted 4 April, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

Comments: 41 pages, 6 figures

Journal ref: SciPost Phys. 16, 129 (2024)

arXiv:2310.02434 [pdf, other]

Subrelativistic Alternating Phase Focusing Dielectric Laser Accelerators

Authors: Payton Broaddus, Thilo Egenolf, Dylan S. Black, Melanie Murillo, Clarisse Woodahl, Yu Miao, Uwe Niedermayer, Robert L. Byer, Kenneth J. Leedle, Olav Solgaard

Abstract: We demonstrate a silicon-based electron accelerator that uses laser optical near fields to both accelerate and confine electrons over extended distances. Two dielectric laser accelerator (DLA) designs were tested, each consisting of two arrays of silicon pillars pumped symmetrically by pulse front tilted laser beams, designed for average acceleration gradients 35 and 50 MeV/m respectively. The DLA… ▽ More We demonstrate a silicon-based electron accelerator that uses laser optical near fields to both accelerate and confine electrons over extended distances. Two dielectric laser accelerator (DLA) designs were tested, each consisting of two arrays of silicon pillars pumped symmetrically by pulse front tilted laser beams, designed for average acceleration gradients 35 and 50 MeV/m respectively. The DLAs are designed to act as alternating phase focusing (APF) lattices, where electrons, depending on the electron-laser interaction phase, will alternate between opposing longitudinal and transverse focusing and defocusing forces. By incorporating fractional period drift sections that alter the synchronous phase between $\pm 60^\circ$ off crest, electrons captured in the designed acceleration bucket experience half the peak gradient as average gradient while also experiencing strong confinement forces that enable long interaction lengths. We demonstrate APF accelerators with interaction lengths up to 708 $μ$m and energy gains up to 23.7 $\pm$ 1.07 keV FWHM, a 25$\%$ increase from starting energy, demonstrating the ability to achieve substantial energy gains with subrelativistic DLA. △ Less

Submitted 12 March, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

Comments: 16 pages

arXiv:2310.00253 [pdf, other]

doi 10.1088/1361-6382/ad6129

Scalar fields around a rotating loop quantum gravity black hole: Waveform, quasi-normal modes and superradiance

Authors: Zhong-Wu Xia, Hao Yang, Yan-Gang Miao

Abstract: The rotating loop quantum gravity black hole is a newly proposed non-singular black hole, which eliminates spacetime singularities when a regularization parameter is introduced through loop quantum corrections. This parameter is expected to give rise to observable effects. In this paper, the dynamical behavior of a scalar field near a rotating loop quantum gravity black hole is investigated. Given… ▽ More The rotating loop quantum gravity black hole is a newly proposed non-singular black hole, which eliminates spacetime singularities when a regularization parameter is introduced through loop quantum corrections. This parameter is expected to give rise to observable effects. In this paper, the dynamical behavior of a scalar field near a rotating loop quantum gravity black hole is investigated. Given a small initial perturbation, we obtain the waveform of massless scalar fields evolving over time. By analyzing the waveform, we find that the regularization parameter only affects the damping oscillation of waveform, but not the initial outburst and late-time tail stages. This behavior is characterized by quasi-normal modes. Under scalar field perturbations, the loop quantum black holes remain stable. Moreover, we calculate the quasi-normal modes of massive scalar fields by three numerical methods, which are the Prony, WKB, and shooting methods, respectively. Our results indicate that the real part of quasi-normal modes depends only on the regularization parameter, while the imaginary part does not only on the regularization parameter but also on the angular momentum. Finally, we study the amplification effect of rotating black holes, i.e., the superradiance. Our analyses indicate the existence of stronger superradiance around loop quantum gravity black holes compared to Kerr ones. △ Less

Submitted 9 July, 2024; v1 submitted 30 September, 2023; originally announced October 2023.

Comments: v1: 24 pages, 6 figures, 2 tables; v2: 26 pages, clarifications and references added, final version to appear in Classical and Quantum Gravity

Journal ref: Class. Quantum Grav. 41 (2024) 165010 (22 pages)

arXiv:2309.14737 [pdf, other]

Volumetric Semantically Consistent 3D Panoptic Mapping

Authors: Yang Miao, Iro Armeni, Marc Pollefeys, Daniel Barath

Abstract: We introduce an online 2D-to-3D semantic instance mapping algorithm aimed at generating comprehensive, accurate, and efficient semantic 3D maps suitable for autonomous agents in unstructured environments. The proposed approach is based on a Voxel-TSDF representation used in recent algorithms. It introduces novel ways of integrating semantic prediction confidence during mapping, producing semantic… ▽ More We introduce an online 2D-to-3D semantic instance mapping algorithm aimed at generating comprehensive, accurate, and efficient semantic 3D maps suitable for autonomous agents in unstructured environments. The proposed approach is based on a Voxel-TSDF representation used in recent algorithms. It introduces novel ways of integrating semantic prediction confidence during mapping, producing semantic and instance-consistent 3D regions. Further improvements are achieved by graph optimization-based semantic labeling and instance refinement. The proposed method achieves accuracy superior to the state of the art on public large-scale datasets, improving on a number of widely used metrics. We also highlight a downfall in the evaluation of recent studies: using the ground truth trajectory as input instead of a SLAM-estimated one substantially affects the accuracy, creating a large gap between the reported results and the actual performance on real-world data. △ Less

Submitted 8 July, 2024; v1 submitted 26 September, 2023; originally announced September 2023.

Comments: 8 pages, 2 figures

arXiv:2309.10895 [pdf, ps, other]

Large Language Models as Agents in the Clinic

Authors: Nikita Mehandru, Brenda Y. Miao, Eduardo Rodriguez Almaraz, Madhumita Sushil, Atul J. Butte, Ahmed Alaa

Abstract: Recent developments in large language models (LLMs) have unlocked new opportunities for healthcare, from information synthesis to clinical decision support. These new LLMs are not just capable of modeling language, but can also act as intelligent "agents" that interact with stakeholders in open-ended conversations and even influence clinical decision-making. Rather than relying on benchmarks that… ▽ More Recent developments in large language models (LLMs) have unlocked new opportunities for healthcare, from information synthesis to clinical decision support. These new LLMs are not just capable of modeling language, but can also act as intelligent "agents" that interact with stakeholders in open-ended conversations and even influence clinical decision-making. Rather than relying on benchmarks that measure a model's ability to process clinical data or answer standardized test questions, LLM agents should be assessed for their performance on real-world clinical tasks. These new evaluation frameworks, which we call "Artificial-intelligence Structured Clinical Examinations" ("AI-SCI"), can draw from comparable technologies where machines operate with varying degrees of self-governance, such as self-driving cars. High-fidelity simulations may also be used to evaluate interactions between users and LLMs within a clinical workflow, or to model the dynamic interactions of multiple LLMs. Developing these robust, real-world clinical evaluations will be crucial towards deploying LLM agents into healthcare. △ Less

Submitted 19 September, 2023; originally announced September 2023.

Comments: 4 pages

arXiv:2309.05557 [pdf, other]

An Empirical Study of NetOps Capability of Pre-Trained Large Language Models

Authors: Yukai Miao, Yu Bai, Li Chen, Dan Li, Haifeng Sun, Xizheng Wang, Ziqiu Luo, Yanyu Ren, Dapeng Sun, Xiuting Xu, Qi Zhang, Chao Xiang, Xinchi Li

Abstract: Nowadays, the versatile capabilities of Pre-trained Large Language Models (LLMs) have attracted much attention from the industry. However, some vertical domains are more interested in the in-domain capabilities of LLMs. For the Networks domain, we present NetEval, an evaluation set for measuring the comprehensive capabilities of LLMs in Network Operations (NetOps). NetEval is designed for evaluati… ▽ More Nowadays, the versatile capabilities of Pre-trained Large Language Models (LLMs) have attracted much attention from the industry. However, some vertical domains are more interested in the in-domain capabilities of LLMs. For the Networks domain, we present NetEval, an evaluation set for measuring the comprehensive capabilities of LLMs in Network Operations (NetOps). NetEval is designed for evaluating the commonsense knowledge and inference ability in NetOps in a multi-lingual context. NetEval consists of 5,732 questions about NetOps, covering five different sub-domains of NetOps. With NetEval, we systematically evaluate the NetOps capability of 26 publicly available LLMs. The results show that only GPT-4 can achieve a performance competitive to humans. However, some open models like LLaMA 2 demonstrate significant potential. △ Less

Submitted 19 September, 2023; v1 submitted 11 September, 2023; originally announced September 2023.

arXiv:2309.05028 [pdf, other]

SC-NeRF: Self-Correcting Neural Radiance Field with Sparse Views

Authors: Liang Song, Guangming Wang, Jiuming Liu, Zhenyang Fu, Yanzi Miao, Hesheng

Abstract: In recent studies, the generalization of neural radiance fields for novel view synthesis task has been widely explored. However, existing methods are limited to objects and indoor scenes. In this work, we extend the generalization task to outdoor scenes, trained only on object-level datasets. This approach presents two challenges. Firstly, the significant distributional shift between training and… ▽ More In recent studies, the generalization of neural radiance fields for novel view synthesis task has been widely explored. However, existing methods are limited to objects and indoor scenes. In this work, we extend the generalization task to outdoor scenes, trained only on object-level datasets. This approach presents two challenges. Firstly, the significant distributional shift between training and testing scenes leads to black artifacts in rendering results. Secondly, viewpoint changes in outdoor scenes cause ghosting or missing regions in rendered images. To address these challenges, we propose a geometric correction module and an appearance correction module based on multi-head attention mechanisms. We normalize rendered depth and combine it with light direction as query in the attention mechanism. Our network effectively corrects varying scene structures and geometric features in outdoor scenes, generalizing well from object-level to unseen outdoor scenes. Additionally, we use appearance correction module to correct appearance features, preventing rendering artifacts like blank borders and ghosting due to viewpoint changes. By combining these modules, our approach successfully tackles the challenges of outdoor scene generalization, producing high-quality rendering results. When evaluated on four datasets (Blender, DTU, LLFF, Spaces), our network outperforms previous methods. Notably, compared to MVSNeRF, our network improves average PSNR from 19.369 to 25.989, SSIM from 0.838 to 0.889, and reduces LPIPS from 0.265 to 0.224 on Spaces outdoor scenes. △ Less

Submitted 10 September, 2023; originally announced September 2023.

arXiv:2308.13232 [pdf, other]

Estimating and approaching maximum information rate of noninvasive visual brain-computer interface

Authors: Nanlin Shi, Yining Miao, Changxing Huang, Xiang Li, Yonghao Song, Xiaogang Chen, Yijun Wang, Xiaorong Gao

Abstract: The mission of visual brain-computer interfaces (BCIs) is to enhance information transfer rate (ITR) to reach high speed towards real-life communication. Despite notable progress, noninvasive visual BCIs have encountered a plateau in ITRs, leaving it uncertain whether higher ITRs are achievable. In this study, we investigate the information rate limits of the primary visual channel to explore whet… ▽ More The mission of visual brain-computer interfaces (BCIs) is to enhance information transfer rate (ITR) to reach high speed towards real-life communication. Despite notable progress, noninvasive visual BCIs have encountered a plateau in ITRs, leaving it uncertain whether higher ITRs are achievable. In this study, we investigate the information rate limits of the primary visual channel to explore whether we can and how we should build visual BCI with higher information rate. Using information theory, we estimate a maximum achievable ITR of approximately 63 bits per second (bps) with a uniformly-distributed White Noise (WN) stimulus. Based on this discovery, we propose a broadband WN BCI approach that expands the utilization of stimulus bandwidth, in contrast to the current state-of-the-art visual BCI methods based on steady-state visual evoked potentials (SSVEPs). Through experimental validation, our broadband BCI outperforms the SSVEP BCI by an impressive margin of 7 bps, setting a new record of 50 bps. This achievement demonstrates the possibility of decoding 40 classes of noninvasive neural responses within a short duration of only 0.1 seconds. The information-theoretical framework introduced in this study provides valuable insights applicable to all sensory-evoked BCIs, making a significant step towards the development of next-generation human-machine interaction systems. △ Less

Submitted 25 August, 2023; originally announced August 2023.

arXiv:2308.05479 [pdf, ps, other]

doi 10.1038/s41598-023-40063-2

30-min Decayless Kink Oscillations in a Very Long Bundle of Solar Coronal Plasma Loops

Authors: Sihui Zhong, Valery M. Nakariakov, Yuhu Miao, Libo Fu, Ding Yuan

Abstract: The energy balance in the corona of the Sun is the key to the long-standing coronal heating dilemma, which could be potentially revealed by observational studies of decayless kink oscillations of coronal plasma loops. A bundle of very long off-limb coronal loops with the length of $736\pm80$ Mm and a lifetime of about 2 days are found to exhibit decayless kink oscillations. The oscillations were o… ▽ More The energy balance in the corona of the Sun is the key to the long-standing coronal heating dilemma, which could be potentially revealed by observational studies of decayless kink oscillations of coronal plasma loops. A bundle of very long off-limb coronal loops with the length of $736\pm80$ Mm and a lifetime of about 2 days are found to exhibit decayless kink oscillations. The oscillations were observed for several hours. The oscillation amplitude was measured at 0.3-0.5 Mm, and the period at 28-33 min. The existence of 30-min periodicity of decayless kink oscillations indicates that the mechanism compensating the wave damping is still valid in such a massive plasma structure. It provides important evidence for the non-resonant origin of decayless kink oscillations with 2-6min periods, i.e., the lack of their link with the leakage of photospheric and chromospheric oscillations into the corona and the likely role of the broadband energy sources. Magnetohydrodynamic seismology based on the reported detection of the kink oscillation, with the assistance of the differential emission measure analysis and a background coronal model provides us with a comprehensive set of plasma and magnetic field diagnostics, which is of interest as input parameters of space weather models. △ Less

Submitted 10 August, 2023; originally announced August 2023.

Comments: 15 pages, 5 figures, accepted by Scientific Reports

arXiv:2308.03853 [pdf]

doi 10.1056/AIdbp2300110

CORAL: Expert-Curated medical Oncology Reports to Advance Language Model Inference

Authors: Madhumita Sushil, Vanessa E. Kennedy, Divneet Mandair, Brenda Y. Miao, Travis Zack, Atul J. Butte

Abstract: Both medical care and observational studies in oncology require a thorough understanding of a patient's disease progression and treatment history, often elaborately documented in clinical notes. Despite their vital role, no current oncology information representation and annotation schema fully encapsulates the diversity of information recorded within these notes. Although large language models (L… ▽ More Both medical care and observational studies in oncology require a thorough understanding of a patient's disease progression and treatment history, often elaborately documented in clinical notes. Despite their vital role, no current oncology information representation and annotation schema fully encapsulates the diversity of information recorded within these notes. Although large language models (LLMs) have recently exhibited impressive performance on various medical natural language processing tasks, due to the current lack of comprehensively annotated oncology datasets, an extensive evaluation of LLMs in extracting and reasoning with the complex rhetoric in oncology notes remains understudied. We developed a detailed schema for annotating textual oncology information, encompassing patient characteristics, tumor characteristics, tests, treatments, and temporality. Using a corpus of 40 de-identified breast and pancreatic cancer progress notes at University of California, San Francisco, we applied this schema to assess the zero-shot abilities of three recent LLMs (GPT-4, GPT-3.5-turbo, and FLAN-UL2) to extract detailed oncological history from two narrative sections of clinical progress notes. Our team annotated 9028 entities, 9986 modifiers, and 5312 relationships. The GPT-4 model exhibited overall best performance, with an average BLEU score of 0.73, an average ROUGE score of 0.72, an exact-match F1-score of 0.51, and an average accuracy of 68% on complex tasks (expert manual evaluation on subset). Notably, it was proficient in tumor characteristic and medication extraction, and demonstrated superior performance in relational inference like adverse event detection. However, further improvements are needed before using it to reliably extract important facts from cancer progress notes needed for clinical research, complex population management, and documenting quality patient care. △ Less

Submitted 11 January, 2024; v1 submitted 7 August, 2023; originally announced August 2023.

Comments: Source code available at: https://github.com/MadhumitaSushil/OncLLMExtraction

arXiv:2308.03068 [pdf, other]

doi 10.1088/1674-1137/ad34c1

Preliminary analyses on dynamics and thermodynamics of rotating regular black holes

Authors: Hao Yang, Chang-Jiang Yu, Yan-Gang Miao

Abstract: We investigate the dynamic and thermodynamic laws governing rotating regular black holes. By analyzing dynamic properties, i.e., the interaction between scalar particles and rotating regular black holes, we establish the criteria that determine whether such black holes satisfy the laws of thermodynamics or not. In addition, we provide the general form of conserved quantities related to rotating re… ▽ More We investigate the dynamic and thermodynamic laws governing rotating regular black holes. By analyzing dynamic properties, i.e., the interaction between scalar particles and rotating regular black holes, we establish the criteria that determine whether such black holes satisfy the laws of thermodynamics or not. In addition, we provide the general form of conserved quantities related to rotating regular black holes, including the relevant flows associated with neutral scalar particles. Meanwhile, we reexamine the relationship between the third law of thermodynamics and weak cosmic censorship conjecture for rotating regular black holes. In accordance with the criteria mentioned above, we discuss the laws of thermodynamics for three models of rotating regular black holes: Rotating Hayward black holes, Kerr black-bounce solutions, and loop quantum gravity black holes. Our findings indicate that none of the three models satisfies the first law of thermodynamics. In particular, the first and third models fail to comply with the three laws of thermodynamics, while the second model satisfies only the second and third laws of thermodynamics. Finally, we attempt to rescue the laws of thermodynamics by modifying entropy or extending phase space. However, the two scenarios are not able to ensure the three laws of thermodynamics in the three models, which reveals an unusual property of rotating regular black holes. △ Less

Submitted 17 May, 2024; v1 submitted 6 August, 2023; originally announced August 2023.

Comments: v1: 33 pages, 2 figures; v2: 37 pages, modifications and one reference added, final version to appear in Chinese Physics C

Journal ref: Chinese Physics C 48 (2024) 075101

arXiv:2308.01857 [pdf, other]

iEDA: An Open-Source Intelligent Physical Implementation Toolkit and Library

Authors: Xingquan Li, Simin Tao, Zengrong Huang, Shijian Chen, Zhisheng Zeng, Liwei Ni, Zhipeng Huang, Chunan Zhuang, Hongxi Wu, Weiguo Li1, Xueyan Zhao, He Liu, Shuaiying Long, Wei He, Bojun Liu, Sifeng Gan, Zihao Yu, Tong Liu, Yuchi Miao, Zhiyuan Yan, Hao Wang, Jie Zhao, Yifan Li, Ruizhi Liu, Xiaoze Lin , et al. (31 additional authors not shown)

Abstract: Open-source EDA shows promising potential in unleashing EDA innovation and lowering the cost of chip design. This paper presents an open-source EDA project, iEDA, aiming for building a basic infrastructure for EDA technology evolution and closing the industrial-academic gap in the EDA area. iEDA now covers the whole flow of physical design (including Floorplan, Placement, CTS, Routing, Timing Opti… ▽ More Open-source EDA shows promising potential in unleashing EDA innovation and lowering the cost of chip design. This paper presents an open-source EDA project, iEDA, aiming for building a basic infrastructure for EDA technology evolution and closing the industrial-academic gap in the EDA area. iEDA now covers the whole flow of physical design (including Floorplan, Placement, CTS, Routing, Timing Optimization etc.), and part of the analysis tools (Static Timing Analysis and Power Analysis). To demonstrate the effectiveness of iEDA, we implement and tape out three chips of different scales (from 700k to 1.5M gates) on different process nodes (110nm and 28nm) with iEDA. iEDA is publicly available from the project home page http://ieda.oscc.cc. △ Less

Submitted 3 August, 2023; originally announced August 2023.

arXiv:2307.01042 [pdf]

doi 10.1038/s41467-023-39500-7

A unique van Hove singularity in kagome superconductor CsV$_{3-x}$Ta$_x$Sb$_5$ with enhanced superconductivity

Authors: Yang Luo, Yulei Han, Jinjin Liu, Hui Chen, Zihao Huang, Linwei Huai, Hongyu Li, Bingqian Wang, Jianchang Shen, Shuhan Ding, Zeyu Li, Shuting Peng, Zhiyuan Wei, Yu Miao, Xiupeng Sun, Zhipeng Ou, Ziji Xiang, Makoto Hashimoto, Donghui Lu, Yugui Yao, Haitao Yang, Xianhui Chen, Hong-Jun Gao, Zhenhua Qiao, Zhiwei Wang , et al. (1 additional authors not shown)

Abstract: Van Hove singularity (VHS) has been considered as a driving source for unconventional superconductivity. A VHS in two-dimensional (2D) materials consists of a saddle point connecting electron-like and hole-like bands. In a rare case, when a VHS appears at Fermi level, both electron-like and hole-like conduction can coexist, giving rise to an enhanced density of states as well as an attractive comp… ▽ More Van Hove singularity (VHS) has been considered as a driving source for unconventional superconductivity. A VHS in two-dimensional (2D) materials consists of a saddle point connecting electron-like and hole-like bands. In a rare case, when a VHS appears at Fermi level, both electron-like and hole-like conduction can coexist, giving rise to an enhanced density of states as well as an attractive component of Coulomb interaction for unconventional electronic pairing. However, this van Hove scenario is often destroyed by an incorrect chemical potential or competing instabilities. Here, by using angle-resolved photoemission measurements, we report the observation of a VHS perfectly aligned with the Fermi level in a kagome superconductor CsV$_{3-x}$Ta$_x$Sb$_5$ (x~0.4), in which a record-high superconducting transition temperature is achieved among all the current variants of AV$_3$Sb$_5$ (A=Cs, Rb, K) at ambient pressure. Doping dependent measurements reveal the important role of van Hove scenario in boosting superconductivity, and spectroscopic-imaging scanning tunneling microscopy measurements indicate a distinct superconducting state in this system. △ Less

Submitted 3 July, 2023; originally announced July 2023.

Comments: 20 pages, 4 figures

Journal ref: Nature Communications 14, 3819 (2023)

arXiv:2307.00534 [pdf, other]

Shared Growth of Graph Neural Networks via Prompted Free-direction Knowledge Distillation

Authors: Kaituo Feng, Yikun Miao, Changsheng Li, Ye Yuan, Guoren Wang

Abstract: Knowledge distillation (KD) has shown to be effective to boost the performance of graph neural networks (GNNs), where the typical objective is to distill knowledge from a deeper teacher GNN into a shallower student GNN. However, it is often quite challenging to train a satisfactory deeper GNN due to the well-known over-parametrized and over-smoothing issues, leading to invalid knowledge transfer i… ▽ More Knowledge distillation (KD) has shown to be effective to boost the performance of graph neural networks (GNNs), where the typical objective is to distill knowledge from a deeper teacher GNN into a shallower student GNN. However, it is often quite challenging to train a satisfactory deeper GNN due to the well-known over-parametrized and over-smoothing issues, leading to invalid knowledge transfer in practical applications. In this paper, we propose the first Free-direction Knowledge Distillation framework via reinforcement learning for GNNs, called FreeKD, which is no longer required to provide a deeper well-optimized teacher GNN. Our core idea is to collaboratively learn two shallower GNNs to exchange knowledge between them. As we observe that one typical GNN model often exhibits better and worse performances at different nodes during training, we devise a dynamic and free-direction knowledge transfer strategy that involves two levels of actions: 1) node-level action determines the directions of knowledge transfer between the corresponding nodes of two networks; and then 2) structure-level action determines which of the local structures generated by the node-level actions to be propagated. Additionally, considering that different augmented graphs can potentially capture distinct perspectives of the graph data, we propose FreeKD-Prompt that learns undistorted and diverse augmentations based on prompt learning for exchanging varied knowledge. Furthermore, instead of confining knowledge exchange within two GNNs, we develop FreeKD++ to enable free-direction knowledge transfer among multiple GNNs. Extensive experiments on five benchmark datasets demonstrate our approaches outperform the base GNNs in a large margin. More surprisingly, our FreeKD has comparable or even better performance than traditional KD algorithms that distill knowledge from a deeper and stronger teacher GNN. △ Less

Submitted 16 November, 2023; v1 submitted 2 July, 2023; originally announced July 2023.

Comments: arXiv admin note: substantial text overlap with arXiv:2206.06561

arXiv:2306.12709 [pdf, other]

doi 10.1016/j.nuclphysb.2024.116491

Recovery of consistency in thermodynamics of regular black holes in Einstein's gravity coupled with nonlinear electrodynamics

Authors: Yang Guo, Hao Xie, Yan-Gang Miao

Abstract: As one of candidate theories in the construction of regular black holes, Einstein's gravity coupled with nonlinear electrodynamics has been a topic of great concerns. Owing to the coupling between Einstein's gravity and nonlinear electromagnetic fields, we need to reconsider the first law of thermodynamics, which will lead to a new thermodynamic phase space. In such a phase space, the equation of… ▽ More As one of candidate theories in the construction of regular black holes, Einstein's gravity coupled with nonlinear electrodynamics has been a topic of great concerns. Owing to the coupling between Einstein's gravity and nonlinear electromagnetic fields, we need to reconsider the first law of thermodynamics, which will lead to a new thermodynamic phase space. In such a phase space, the equation of state accurately describes the complete phase transition process of regular black holes. The Maxwell equal area law strictly holds when the phase transition occurs, and the entropy obeys the Bekenstein-Hawking area formula, which is compatible with the situation in Einstein's gravity. △ Less

Submitted 27 February, 2024; v1 submitted 22 June, 2023; originally announced June 2023.

Comments: v1: 18 pages, 7 figures; v2: 20 pages, clarifications and references added, final version to appear in Nuclear Physics B

Journal ref: Nucl. Phys. B 1000 (2024) 116491 (14 pages)

arXiv:2306.12113 [pdf, other]

Lightweight wood panel defect detection method incorporating attention mechanism and feature fusion network

Authors: Yongxin Cao, Fanghua Liu, Lai Jiang, Cheng Bao, You Miao, Yang Chen

Abstract: In recent years, deep learning has made significant progress in wood panel defect detection. However, there are still challenges such as low detection , slow detection speed, and difficulties in deploying embedded devices on wood panel surfaces. To overcome these issues, we propose a lightweight wood panel defect detection method called YOLOv5-LW, which incorporates attention mechanisms and a feat… ▽ More In recent years, deep learning has made significant progress in wood panel defect detection. However, there are still challenges such as low detection , slow detection speed, and difficulties in deploying embedded devices on wood panel surfaces. To overcome these issues, we propose a lightweight wood panel defect detection method called YOLOv5-LW, which incorporates attention mechanisms and a feature fusion network.Firstly, to enhance the detection capability of acceptable defects, we introduce the Multi-scale Bi-directional Feature Pyramid Network (MBiFPN) as a feature fusion network. The MBiFPN reduces feature loss, enriches local and detailed features, and improves the model's detection capability for acceptable defects.Secondly, to achieve a lightweight design, we reconstruct the ShuffleNetv2 network model as the backbone network. This reconstruction reduces the number of parameters and computational requirements while maintaining performance. We also introduce the Stem Block and Spatial Pyramid Pooling Fast (SPPF) models to compensate for any accuracy loss resulting from the lightweight design, ensuring the model's detection capabilities remain intact while being computationally efficient.Thirdly, we enhance the backbone network by incorporating Efficient Channel Attention (ECA), which improves the network's focus on key information relevant to defect detection. By attending to essential features, the model becomes more proficient in accurately identifying and localizing defects.We validate the proposed method using a self-developed wood panel defect dataset.The experimental results demonstrate the effectiveness of the improved YOLOv5-LW method. Compared to the original model, our approach achieves a 92.8\% accuracy rate, reduces the number of parameters by 27.78\%, compresses computational volume by 41.25\%, improves detection inference speed by 10.16\% △ Less

Submitted 21 June, 2023; originally announced June 2023.

arXiv:2306.11400 [pdf, other]

doi 10.1109/ICME55011.2023.00013

MuDPT: Multi-modal Deep-symphysis Prompt Tuning for Large Pre-trained Vision-Language Models

Authors: Yongzhu Miao, Shasha Li, Jintao Tang, Ting Wang

Abstract: Prompt tuning, like CoOp, has recently shown promising vision recognizing and transfer learning ability on various downstream tasks with the emergence of large pre-trained vision-language models like CLIP. However, we identify that existing uni-modal prompt tuning approaches may result in sub-optimal performance since this uni-modal design breaks the original alignment of textual and visual repres… ▽ More Prompt tuning, like CoOp, has recently shown promising vision recognizing and transfer learning ability on various downstream tasks with the emergence of large pre-trained vision-language models like CLIP. However, we identify that existing uni-modal prompt tuning approaches may result in sub-optimal performance since this uni-modal design breaks the original alignment of textual and visual representations in the pre-trained model. Inspired by the nature of pre-trained vision-language models, we aim to achieve completeness in prompt tuning and propose a novel approach called Multi-modal Deep-symphysis Prompt Tuning, dubbed as MuDPT, which extends independent multi-modal prompt tuning by additionally learning a model-agnostic transformative network to allow deep hierarchical bi-directional prompt fusion. We evaluate the effectiveness of MuDPT on few-shot vision recognition and out-of-domain generalization tasks. Compared with the state-of-the-art methods, MuDPT achieves better recognition and generalization ability with an apparent margin thanks to synergistic alignment of textual and visual representations. Our code is available at: https://github.com/Mechrev0/MuDPT. △ Less

Submitted 14 July, 2024; v1 submitted 20 June, 2023; originally announced June 2023.

Comments: The paper has been accepted by ICME 2023

arXiv:2306.09792 [pdf, other]

GPINN: Physics-informed Neural Network with Graph Embedding

Authors: Yuyang Miao, Haolin Li

Abstract: This work proposes a Physics-informed Neural Network framework with Graph Embedding (GPINN) to perform PINN in graph, i.e. topological space instead of traditional Euclidean space, for improved problem-solving efficiency. The method integrates topological data into the neural network's computations, which significantly boosts the performance of the Physics-Informed Neural Network (PINN). The graph… ▽ More This work proposes a Physics-informed Neural Network framework with Graph Embedding (GPINN) to perform PINN in graph, i.e. topological space instead of traditional Euclidean space, for improved problem-solving efficiency. The method integrates topological data into the neural network's computations, which significantly boosts the performance of the Physics-Informed Neural Network (PINN). The graph embedding technique infuses extra dimensions into the input space to encapsulate the spatial characteristics of a graph while preserving the properties of the original space. The selection of these extra dimensions is guided by the Fiedler vector, offering an optimised pathologic notation of the graph. Two case studies are conducted, which demonstrate significant improvement in the performance of GPINN in comparison to traditional PINN, particularly in its superior ability to capture physical features of the solution. △ Less

Submitted 16 June, 2023; originally announced June 2023.

arXiv:2306.08423 [pdf, other]

DistSim: A performance model of large-scale hybrid distributed DNN training

Authors: Guandong Lu, Runzhe Chen, Yakai Wang, Yangjie Zhou, Rui Zhang, Zheng Hu, Yanming Miao, Zhifang Cai, Li Li, Jingwen Leng, Minyi Guo

Abstract: With the ever-increasing computational demand of DNN training workloads, distributed training has been widely adopted. A combination of data, model and pipeline parallelism strategy, called hybrid parallelism distributed training, is imported to tackle the problem of deploying large-scale models. However, how to evaluate the hybrid strategy and the utilization of each device remains a challenge si… ▽ More With the ever-increasing computational demand of DNN training workloads, distributed training has been widely adopted. A combination of data, model and pipeline parallelism strategy, called hybrid parallelism distributed training, is imported to tackle the problem of deploying large-scale models. However, how to evaluate the hybrid strategy and the utilization of each device remains a challenge since existing works either profile on a real large-scale cluster with high time and money costs or only analyze a specific type of parallelism without considering the hybrid parallelism. In this work, we proposed DistSim, an event-based performance model to accurately analyze each device's computation and communication activities with low profiling costs. DistDim breaks down the model into events according to the given distributed strategy, which can be profiled on two nodes. Then DistSim leverages the hierarchy of different parallel strategies to generate the computation and communication event-flow from layer level to model level and finally the activity timeline of each device participating in training. Experiment shows that DistSim can reach \revise{<4\%} errors when predicting distributing training batch time and \revise{<5\%} errors when predicting a single device's activity time in various hybrid strategy settings. We also provide a use-case of DistSim, automatically evaluate and search the best distributed training strategy, and find a hybrid strategy with at most $7.37\times$ throughput improvement. △ Less

Submitted 14 June, 2023; originally announced June 2023.

arXiv:2306.04362 [pdf, other]

Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Dataset for Pre-training and Benchmarks

Authors: Haiyang Xu, Qinghao Ye, Xuan Wu, Ming Yan, Yuan Miao, Jiabo Ye, Guohai Xu, Anwen Hu, Yaya Shi, Guangwei Xu, Chenliang Li, Qi Qian, Maofei Que, Ji Zhang, Xiao Zeng, Fei Huang

Abstract: To promote the development of Vision-Language Pre-training (VLP) and multimodal Large Language Model (LLM) in the Chinese community, we firstly release the largest public Chinese high-quality video-language dataset named Youku-mPLUG, which is collected from Youku, a well-known Chinese video-sharing website, with strict criteria of safety, diversity, and quality. Youku-mPLUG contains 10 million Chi… ▽ More To promote the development of Vision-Language Pre-training (VLP) and multimodal Large Language Model (LLM) in the Chinese community, we firstly release the largest public Chinese high-quality video-language dataset named Youku-mPLUG, which is collected from Youku, a well-known Chinese video-sharing website, with strict criteria of safety, diversity, and quality. Youku-mPLUG contains 10 million Chinese video-text pairs filtered from 400 million raw videos across a wide range of 45 diverse categories for large-scale pre-training. In addition, to facilitate a comprehensive evaluation of video-language models, we carefully build the largest human-annotated Chinese benchmarks covering three popular video-language tasks of cross-modal retrieval, video captioning, and video category classification. Youku-mPLUG can enable researchers to conduct more in-depth multimodal research and develop better applications in the future. Furthermore, we release popular video-language pre-training models, ALPRO and mPLUG-2, and our proposed modularized decoder-only model mPLUG-video pre-trained on Youku-mPLUG. Experiments show that models pre-trained on Youku-mPLUG gain up to 23.1% improvement in video category classification. Besides, mPLUG-video achieves a new state-of-the-art result on these benchmarks with 80.5% top-1 accuracy in video category classification and 68.9 CIDEr score in video captioning, respectively. Finally, we scale up mPLUG-video based on the frozen Bloomz with only 1.7% trainable parameters as Chinese multimodal LLM, and demonstrate impressive instruction and video understanding ability. The zero-shot instruction understanding experiment indicates that pretraining with Youku-mPLUG can enhance the ability to comprehend overall and detailed visual semantics, recognize scene text, and leverage open-domain knowledge. △ Less

Submitted 7 June, 2023; originally announced June 2023.

Comments: Working in progress

arXiv:2306.04240 [pdf, other]

T-ADAF: Adaptive Data Augmentation Framework for Image Classification Network based on Tensor T-product Operator

Authors: Feiyang Han, Yun Miao, Zhaoyi Sun, Yimin Wei

Abstract: Image classification is one of the most fundamental tasks in Computer Vision. In practical applications, the datasets are usually not as abundant as those in the laboratory and simulation, which is always called as Data Hungry. How to extract the information of data more completely and effectively is very important. Therefore, an Adaptive Data Augmentation Framework based on the tensor T-product O… ▽ More Image classification is one of the most fundamental tasks in Computer Vision. In practical applications, the datasets are usually not as abundant as those in the laboratory and simulation, which is always called as Data Hungry. How to extract the information of data more completely and effectively is very important. Therefore, an Adaptive Data Augmentation Framework based on the tensor T-product Operator is proposed in this paper, to triple one image data to be trained and gain the result from all these three images together with only less than 0.1% increase in the number of parameters. At the same time, this framework serves the functions of column image embedding and global feature intersection, enabling the model to obtain information in not only spatial but frequency domain, and thus improving the prediction accuracy of the model. Numerical experiments have been designed for several models, and the results demonstrate the effectiveness of this adaptive framework. Numerical experiments show that our data augmentation framework can improve the performance of original neural network model by 2%, which provides competitive results to state-of-the-art methods. △ Less

Submitted 7 June, 2023; originally announced June 2023.

arXiv:2305.19982 [pdf, other]

Adam Accumulation to Reduce Memory Footprints of both Activations and Gradients for Large-scale DNN Training

Authors: Yijia Zhang, Yibo Han, Shijie Cao, Guohao Dai, Youshan Miao, Ting Cao, Fan Yang, Ningyi Xu

Abstract: Running out of GPU memory has become a main bottleneck for large-scale DNN training. How to reduce the memory footprint during training has received intensive research attention. We find that previous gradient accumulation reduces activation memory but fails to be compatible with gradient memory reduction due to a contradiction between preserving gradients and releasing gradients. To address this… ▽ More Running out of GPU memory has become a main bottleneck for large-scale DNN training. How to reduce the memory footprint during training has received intensive research attention. We find that previous gradient accumulation reduces activation memory but fails to be compatible with gradient memory reduction due to a contradiction between preserving gradients and releasing gradients. To address this issue, we propose a novel optimizer accumulation method for Adam, named Adam Accumulation (AdamA), which enables reducing both activation and gradient memory. Specifically, AdamA directly integrates gradients into optimizer states and accumulates optimizer states over micro-batches, so that gradients can be released immediately after use. We mathematically and experimentally demonstrate AdamA yields the same convergence properties as Adam. Evaluated on transformer-based models, AdamA achieves up to 23% memory reduction compared to gradient accumulation with less than 2% degradation in training throughput. Notably, AdamA can work together with memory reduction methods for optimizer states to fit 1.26x~3.14x larger models over PyTorch and DeepSpeed baseline on GPUs with different memory capacities. △ Less

Submitted 31 May, 2023; originally announced May 2023.

arXiv:2305.16617 [pdf, other]

Efficient Detection of LLM-generated Texts with a Bayesian Surrogate Model

Authors: Yibo Miao, Hongcheng Gao, Hao Zhang, Zhijie Deng

Abstract: The detection of machine-generated text, especially from large language models (LLMs), is crucial in preventing serious social problems resulting from their misuse. Some methods train dedicated detectors on specific datasets but fall short in generalizing to unseen test data, while other zero-shot ones often yield suboptimal performance. Although the recent DetectGPT has shown promising detection… ▽ More The detection of machine-generated text, especially from large language models (LLMs), is crucial in preventing serious social problems resulting from their misuse. Some methods train dedicated detectors on specific datasets but fall short in generalizing to unseen test data, while other zero-shot ones often yield suboptimal performance. Although the recent DetectGPT has shown promising detection performance, it suffers from significant inefficiency issues, as detecting a single candidate requires querying the source LLM with hundreds of its perturbations. This paper aims to bridge this gap. Concretely, we propose to incorporate a Bayesian surrogate model, which allows us to select typical samples based on Bayesian uncertainty and interpolate scores from typical samples to other samples, to improve query efficiency. Empirical results demonstrate that our method significantly outperforms existing approaches under a low query budget. Notably, when detecting the text generated by LLaMA family models, our method with just 2 or 3 queries can outperform DetectGPT with 200 queries. △ Less

Submitted 4 June, 2024; v1 submitted 26 May, 2023; originally announced May 2023.

arXiv:2305.14062 [pdf, other]

Amplitude-Independent Machine Learning for PPG through Visibility Graphs and Transfer Learning

Authors: Yuyang Miao, Harry J. Davies, Danilo P. Mandic

Abstract: Photoplethysmography (PPG) refers to the measurement of variations in blood volume using light and is a feature of most wearable devices. The PPG signals provide insight into the body's circulatory system and can be employed to extract various bio-features, such as heart rate and vascular ageing. Although several algorithms have been proposed for this purpose, many exhibit limitations, including h… ▽ More Photoplethysmography (PPG) refers to the measurement of variations in blood volume using light and is a feature of most wearable devices. The PPG signals provide insight into the body's circulatory system and can be employed to extract various bio-features, such as heart rate and vascular ageing. Although several algorithms have been proposed for this purpose, many exhibit limitations, including heavy reliance on human calibration, high signal quality requirements, and a lack of generalisation. In this paper, we introduce a PPG signal processing framework that integrates graph theory and computer vision algorithms, to provide an analysis framework which is amplitude-independent and invariant to affine transformations. It also requires minimal preprocessing, fuses information through RGB channels and exhibits robust generalisation across tasks and datasets. The proposed VGTL-net achieves state-of-the-art performance in the prediction of vascular ageing and demonstrates robust estimation of continuous blood pressure waveforms. △ Less

Submitted 16 January, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

arXiv:2305.12865 [pdf, other]

Automatic Code Summarization via ChatGPT: How Far Are We?

Authors: Weisong Sun, Chunrong Fang, Yudu You, Yun Miao, Yi Liu, Yuekang Li, Gelei Deng, Shenghan Huang, Yuchen Chen, Quanjun Zhang, Hanwei Qian, Yang Liu, Zhenyu Chen

Abstract: To support software developers in understanding and maintaining programs, various automatic code summarization techniques have been proposed to generate a concise natural language comment for a given code snippet. Recently, the emergence of large language models (LLMs) has led to a great boost in the performance of natural language processing tasks. Among them, ChatGPT is the most popular one whic… ▽ More To support software developers in understanding and maintaining programs, various automatic code summarization techniques have been proposed to generate a concise natural language comment for a given code snippet. Recently, the emergence of large language models (LLMs) has led to a great boost in the performance of natural language processing tasks. Among them, ChatGPT is the most popular one which has attracted wide attention from the software engineering community. However, it still remains unclear how ChatGPT performs in (automatic) code summarization. Therefore, in this paper, we focus on evaluating ChatGPT on a widely-used Python dataset called CSN-Python and comparing it with several state-of-the-art (SOTA) code summarization models. Specifically, we first explore an appropriate prompt to guide ChatGPT to generate in-distribution comments. Then, we use such a prompt to ask ChatGPT to generate comments for all code snippets in the CSN-Python test set. We adopt three widely-used metrics (including BLEU, METEOR, and ROUGE-L) to measure the quality of the comments generated by ChatGPT and SOTA models (including NCS, CodeBERT, and CodeT5). The experimental results show that in terms of BLEU and ROUGE-L, ChatGPT's code summarization performance is significantly worse than all three SOTA models. We also present some cases and discuss the advantages and disadvantages of ChatGPT in code summarization. Based on the findings, we outline several open challenges and opportunities in ChatGPT-based code summarization. △ Less

Submitted 22 May, 2023; originally announced May 2023.

MSC Class: 68T50 ACM Class: D.2.3

arXiv:2305.05663 [pdf, ps, other]

Proofs that the Gerber Statistic is Positive Semidefinite

Authors: S. Gerber, H. Markowitz, P. Ernst, Y. Miao, B. Javid, P. Sargen

Abstract: In this brief note, we prove that both forms of the Gerber statistic introduced in Gerber et al. (2022) are positive semi-definite. In this brief note, we prove that both forms of the Gerber statistic introduced in Gerber et al. (2022) are positive semi-definite. △ Less

Submitted 9 May, 2023; originally announced May 2023.

Comments: 5 pages, 0 figures

arXiv:2304.13624 [pdf, other]

doi 10.1103/PhysRevB.108.155102

Quantum many-body scars in spin models with multibody interactions

Authors: Kazuyuki Sanada, Yuan Miao, Hosho Katsura

Abstract: We introduce and study several classes of quantum spin models with multi-body interactions that exhibit quantum many-body scars. The models are constructed by two different methods: one exploiting boundary states in integrable spin chains and the other based on a variant of existing methods such as restricted spectrum generating algebras. The first method allows us to construct deformations of the… ▽ More We introduce and study several classes of quantum spin models with multi-body interactions that exhibit quantum many-body scars. The models are constructed by two different methods: one exploiting boundary states in integrable spin chains and the other based on a variant of existing methods such as restricted spectrum generating algebras. The first method allows us to construct deformations of the Majumdar-Ghosh and Affleck-Kennedy-Lieb-Tasaki models -- prototypes of frustration-free systems. With the second method, we construct a large class of spin-$1$ models involving scalar spin chirality in both one and two dimensions. Interestingly, in some cases, the models so constructed have towers of scar states of different character. For each example, we show that the scar states behave differently from thermal states by comparing their spectral and dynamical properties with those of other states. We also show that a superposition of the scar states constructed by the second method exhibits perfectly periodic revivals in the dynamics. △ Less

Submitted 3 October, 2023; v1 submitted 26 April, 2023; originally announced April 2023.

Comments: 30 pages, 24 figures; v3: some discussions and references added, some typos corrected

Journal ref: Phys. Rev. B 108, 155102 (2023)

arXiv:2304.08741 [pdf, ps, other]

Ideal Secret Sharing Schemes: Combinatorial Characterizations, Certain Access Structures, and Related Geometric Problems

Authors: Ryoh Fuji-Hara, Ying Miao

Abstract: An ideal secret sharing scheme is a method of sharing a secret key in some key space among a finite set of participants in such a way that only the authorized subsets of participants can reconstruct the secret key from their shares which are of the same length as that of the secret key. The set of all authorized subsets of participants is the access structure of the secret sharing scheme. In this… ▽ More An ideal secret sharing scheme is a method of sharing a secret key in some key space among a finite set of participants in such a way that only the authorized subsets of participants can reconstruct the secret key from their shares which are of the same length as that of the secret key. The set of all authorized subsets of participants is the access structure of the secret sharing scheme. In this paper, we derive several properties and restate the combinatorial characterization of an ideal secret sharing scheme in Brickell-Stinson model in terms of orthogonality of its representative array. We propose two practical models, namely the parallel and hierarchical models, for access structures, and then, by the restated characterization, we discuss sufficient conditions on finite geometries for ideal secret sharing schemes to realize these access structure models. Several series of ideal secret sharing schemes realizing special parallel or hierarchical access structure model are constructed from finite projective planes. △ Less

Submitted 18 April, 2023; originally announced April 2023.

Comments: This paper was published in 2009 in the "Journal of Statistics and Applications Vol 4, No. 2-3", which is now inaccessible and has been removed from MathSciNet. I have decided to upload the paper here for those who wish to refer to it

arXiv:2304.07672 [pdf, other]

Optimal Investment and Consumption Strategies with General and Linear Transaction Costs under CRRA Utility

Authors: Yingting Miao, Qiang Zhang

Abstract: Transaction costs play a critical role in asset allocation and consumption strategies in portfolio management. We apply the methods of dynamic programming and singular perturbation expansion to derive the closed-form leading solutions to this problem for small transaction costs with arbitrary transaction cost structure by maximizing the expected CRRA (constant relative risk aversion) utility funct… ▽ More Transaction costs play a critical role in asset allocation and consumption strategies in portfolio management. We apply the methods of dynamic programming and singular perturbation expansion to derive the closed-form leading solutions to this problem for small transaction costs with arbitrary transaction cost structure by maximizing the expected CRRA (constant relative risk aversion) utility function for this problem. We also discuss in detail the case which consists of both fixed and proportional transaction costs. △ Less

Submitted 15 April, 2023; originally announced April 2023.

arXiv:2303.14562 [pdf, other]

Resolution Complete In-Place Object Retrieval given Known Object Models

Authors: Daniel Nakhimovich, Yinglong Miao, Kostas E. Bekris

Abstract: This work proposes a robot task planning framework for retrieving a target object in a confined workspace among multiple stacked objects that obstruct the target. The robot can use prehensile picking and in-workspace placing actions. The method assumes access to 3D models for the visible objects in the scene. The key contribution is in achieving desirable properties, i.e., to provide (a) safety, b… ▽ More This work proposes a robot task planning framework for retrieving a target object in a confined workspace among multiple stacked objects that obstruct the target. The robot can use prehensile picking and in-workspace placing actions. The method assumes access to 3D models for the visible objects in the scene. The key contribution is in achieving desirable properties, i.e., to provide (a) safety, by avoiding collisions with sensed obstacles, objects, and occluded regions, and (b) resolution completeness (RC) - or probabilistic completeness (PC) depending on implementation - which indicates a solution will be eventually found (if it exists) as the resolution of algorithmic parameters increases. A heuristic variant of the basic RC algorithm is also proposed to solve the task more efficiently while retaining the desirable properties. Simulation results compare using random picking and placing operations against the basic RC algorithm that reasons about object dependency as well as its heuristic variant. The success rate is higher for the RC approaches given the same amount of time. The heuristic variant is able to solve the problem even more efficiently than the basic approach. The integration of the RC algorithm with perception, where an RGB-D sensor detects the objects as they are being moved, enables real robot demonstrations of safely retrieving target objects from a cluttered shelf. △ Less

Submitted 25 March, 2023; originally announced March 2023.

Comments: 7 pages, 4 figures, Accepted to IEEE International Conference on Robotics and Automation (ICRA) 2023

arXiv:2303.11696 [pdf, other]

doi 10.1007/s10773-023-05454-1

Regular black holes: A short topic review

Authors: Chen Lan, Hao Yang, Yang Guo, Yan-Gang Miao

Abstract: The essential singularity in Einstein's gravity can be avoidable if the preconditions of Penrose's theorem can be bypassed, i.e., if the strong energy condition is broken in the vicinity of a black hole center. The singularity mentioned here includes two aspects: (i) the divergence of curvature invariants, and (ii) the incompleteness of geodesics. Both aspects are now taken into account in order t… ▽ More The essential singularity in Einstein's gravity can be avoidable if the preconditions of Penrose's theorem can be bypassed, i.e., if the strong energy condition is broken in the vicinity of a black hole center. The singularity mentioned here includes two aspects: (i) the divergence of curvature invariants, and (ii) the incompleteness of geodesics. Both aspects are now taken into account in order to determine whether a black hole contains essential singularities. In this sense, black holes without essential singularities are dubbed regular (non-singular) black holes. The regular black holes have some intriguing phenomena that are different from those of singular black holes, and such phenomena have inspired numerous studies. In this review, we summarize the current topics that are associated with regular black holes. △ Less

Submitted 5 September, 2023; v1 submitted 21 March, 2023; originally announced March 2023.

Comments: Final version to appear in International Journal of Theoretical Physics. Major revision, 45 pages, 2 figures, some references have ben added

Journal ref: Int. J. Theor. Phys. 62, 202 (2023)

arXiv:2303.09400 [pdf, other]

Enhancing Vital Sign Estimation Performance of FMCW MIMO Radar by Prior Human Shape Recognition

Authors: Hadi Alidoustaghdam, Min Chen, Ben Willetts, Kai Mao, André Kokkeler, Yang Miao

Abstract: Radio technology enabled contact-free human posture and vital sign estimation is promising for health monitoring. Radio systems at millimeter-wave (mmWave) frequencies advantageously bring large bandwidth, multi-antenna array and beam steering capability. \textit{However}, the human point cloud obtained by mmWave radar and utilized for posture estimation is likely to be sparse and incomplete. Addi… ▽ More Radio technology enabled contact-free human posture and vital sign estimation is promising for health monitoring. Radio systems at millimeter-wave (mmWave) frequencies advantageously bring large bandwidth, multi-antenna array and beam steering capability. \textit{However}, the human point cloud obtained by mmWave radar and utilized for posture estimation is likely to be sparse and incomplete. Additionally, human's random body movements deteriorate the estimation of breathing and heart rates, therefore the information of the chest location and a narrow radar beam toward the chest are demanded for more accurate vital sign estimation. In this paper, we propose a pipeline aiming to enhance the vital sign estimation performance of mmWave FMCW MIMO radar. The first step is to recognize human body part and posture, where we exploit a trained Convolutional Neural Networks (CNN) to efficiently process the imperfect human form point cloud. The CNN framework outputs the key point of different body parts, and was trained by using RGB image reference and Augmentative Ellipse Fitting Algorithm (AEFA). The next step is to utilize the chest information of the prior estimated human posture for vital sign estimation. While CNN is initially trained based on the frame-by-frame point clouds of human for posture estimation, the vital signs are extracted through beamforming toward the human chest. The numerical results show that this spatial filtering improves the estimation of the vital signs in regard to lowering the level of side harmonics and detecting the harmonics of vital signs efficiently, i.e., peak-to-average power ratio in the harmonics of vital signal is improved up to 0.02 and 0.07dB for the studied cases. △ Less

Submitted 16 March, 2023; originally announced March 2023.

Comments: Accepted for presentation at the IEEE ICC 2023 conference

arXiv:2303.06682 [pdf, other]

DDS2M: Self-Supervised Denoising Diffusion Spatio-Spectral Model for Hyperspectral Image Restoration

Authors: Yuchun Miao, Lefei Zhang, Liangpei Zhang, Dacheng Tao

Abstract: Diffusion models have recently received a surge of interest due to their impressive performance for image restoration, especially in terms of noise robustness. However, existing diffusion-based methods are trained on a large amount of training data and perform very well in-distribution, but can be quite susceptible to distribution shift. This is especially inappropriate for data-starved hyperspect… ▽ More Diffusion models have recently received a surge of interest due to their impressive performance for image restoration, especially in terms of noise robustness. However, existing diffusion-based methods are trained on a large amount of training data and perform very well in-distribution, but can be quite susceptible to distribution shift. This is especially inappropriate for data-starved hyperspectral image (HSI) restoration. To tackle this problem, this work puts forth a self-supervised diffusion model for HSI restoration, namely Denoising Diffusion Spatio-Spectral Model (\texttt{DDS2M}), which works by inferring the parameters of the proposed Variational Spatio-Spectral Module (VS2M) during the reverse diffusion process, solely using the degraded HSI without any extra training data. In VS2M, a variational inference-based loss function is customized to enable the untrained spatial and spectral networks to learn the posterior distribution, which serves as the transitions of the sampling chain to help reverse the diffusion process. Benefiting from its self-supervised nature and the diffusion process, \texttt{DDS2M} enjoys stronger generalization ability to various HSIs compared to existing diffusion-based methods and superior robustness to noise compared to existing HSI restoration methods. Extensive experiments on HSI denoising, noisy HSI completion and super-resolution on a variety of HSIs demonstrate \texttt{DDS2M}'s superiority over the existing task-specific state-of-the-arts. △ Less

Submitted 19 March, 2023; v1 submitted 12 March, 2023; originally announced March 2023.

Comments: 11 pages, 5 figures

arXiv:2303.03931 [pdf, other]

doi 10.1140/epjc/s10052-023-12228-w

A regular black hole as the final state of evolution of a singular black hole

Authors: Han-Wen Hu, Chen Lan, Yan-Gang Miao

Abstract: We propose a novel black hole model in which singular and regular black holes are combined as a whole and more precisely singular and regular black holes are regarded as different states of parameter evolution. We refer to them as singular and regular states, respectively. Furthermore, the regular state is depicted by the final state of parameter evolution in the model. We also present the sources… ▽ More We propose a novel black hole model in which singular and regular black holes are combined as a whole and more precisely singular and regular black holes are regarded as different states of parameter evolution. We refer to them as singular and regular states, respectively. Furthermore, the regular state is depicted by the final state of parameter evolution in the model. We also present the sources that can generate such a black hole spacetime in the framework of $F(R)$ gravity. This theory of modified gravity is adopted because it offers a possible resolution to a tough issue in the thermodynamics of regular black holes, namely the discrepancy between the thermal entropy and Wald entropy. The dynamics and thermodynamics of the novel black hole model are also discussed when a singular state evolves into a regular state during the change of charge or horizon radius from its initial value to its extreme value. △ Less

Submitted 16 November, 2023; v1 submitted 7 March, 2023; originally announced March 2023.

Comments: Final version to appear in the EPJC, 42 pages, 24 figures, references added

Journal ref: Eur. Phys. J. C 83, 1047 (2023)

arXiv:2302.12675 [pdf, other]

doi 10.22331/q-2023-11-03-1160

Integrable Quantum Circuits from the Star-Triangle Relation

Authors: Yuan Miao, Eric Vernier

Abstract: The star-triangle relation plays an important role in the realm of exactly solvable models, offering exact results for classical two-dimensional statistical mechanical models. In this article, we construct integrable quantum circuits using the star-triangle relation. Our construction relies on families of mutually commuting two-parameter transfer matrices for statistical mechanical models solved b… ▽ More The star-triangle relation plays an important role in the realm of exactly solvable models, offering exact results for classical two-dimensional statistical mechanical models. In this article, we construct integrable quantum circuits using the star-triangle relation. Our construction relies on families of mutually commuting two-parameter transfer matrices for statistical mechanical models solved by the star-triangle relation, and differs from previously known constructions based on Yang-Baxter integrable vertex models. At special value of the spectral parameter, the transfer matrices are mapped into integrable quantum circuits, for which infinite families of local conserved charges can be derived. We demonstrate the construction by giving two examples of circuits acting on a chain of $Q-$state qudits: $Q$-state Potts circuits, whose integrability has been conjectured recently by Lotkov et al., and $\mathbb{Z}_Q$ circuits, which are novel to our knowledge. In the first example, we present for $Q=3$ a connection to the Zamolodchikov-Fateev 19-vertex model. △ Less

Submitted 25 October, 2023; v1 submitted 24 February, 2023; originally announced February 2023.

Comments: 25 pages, 9 figures. Accepted by Quantum

Journal ref: Quantum 7, 1160 (2023)

arXiv:2302.05433 [pdf, other]

Unified Functional Hashing in Automatic Machine Learning

Authors: Ryan Gillard, Stephen Jonany, Yingjie Miao, Michael Munn, Connal de Souza, Jonathan Dungay, Chen Liang, David R. So, Quoc V. Le, Esteban Real

Abstract: The field of Automatic Machine Learning (AutoML) has recently attained impressive results, including the discovery of state-of-the-art machine learning solutions, such as neural image classifiers. This is often done by applying an evolutionary search method, which samples multiple candidate solutions from a large space and evaluates the quality of each candidate through a long training process. As… ▽ More The field of Automatic Machine Learning (AutoML) has recently attained impressive results, including the discovery of state-of-the-art machine learning solutions, such as neural image classifiers. This is often done by applying an evolutionary search method, which samples multiple candidate solutions from a large space and evaluates the quality of each candidate through a long training process. As a result, the search tends to be slow. In this paper, we show that large efficiency gains can be obtained by employing a fast unified functional hash, especially through the functional equivalence caching technique, which we also present. The central idea is to detect by hashing when the search method produces equivalent candidates, which occurs very frequently, and this way avoid their costly re-evaluation. Our hash is "functional" in that it identifies equivalent candidates even if they were represented or coded differently, and it is "unified" in that the same algorithm can hash arbitrary representations; e.g. compute graphs, imperative code, or lambda functions. As evidence, we show dramatic improvements on multiple AutoML domains, including neural architecture search and algorithm discovery. Finally, we consider the effect of hash collisions, evaluation noise, and search distribution through empirical analysis. Altogether, we hope this paper may serve as a guide to hashing techniques in AutoML. △ Less

Submitted 10 February, 2023; originally announced February 2023.

ACM Class: I.2.2; I.2.6

arXiv:2302.02414 [pdf, ps, other]

Secure Codes with List Decoding

Authors: Yujie Gu, Ilya Vorobyev, Ying Miao

Abstract: In this paper we consider combinatorial secure codes in traitor tracing for protecting copyright of multimedia content. First, we introduce a new notion of secure codes with list decoding (SCLDs) for collusion-resistant multimedia fingerprinting, which includes many existing types of fingerprinting codes as special cases. Next, we build efficient identifying algorithms for SCLDs with complete trac… ▽ More In this paper we consider combinatorial secure codes in traitor tracing for protecting copyright of multimedia content. First, we introduce a new notion of secure codes with list decoding (SCLDs) for collusion-resistant multimedia fingerprinting, which includes many existing types of fingerprinting codes as special cases. Next, we build efficient identifying algorithms for SCLDs with complete traceability and establish bounds on its largest possible code rate. In comparison with the existing fingerprinting codes, it is shown that SCLDs have not only much more efficient traceability than separable codes but also a much larger code rate than frameproof codes. As a byproduct, new bounds on the largest code rate of binary separable codes are established as well. Furthermore, a two-stage dynamic traitor tracing framework is proposed for multimedia fingerprinting in the dynamic scenario, which could not only efficiently achieve the complete traceability but also provide a much larger capacity than the static scenario. △ Less

Submitted 5 February, 2023; originally announced February 2023.

Comments: 20 pages

arXiv:2301.08984 [pdf, other]

SuperScaler: Supporting Flexible DNN Parallelization via a Unified Abstraction

Authors: Zhiqi Lin, Youshan Miao, Guodong Liu, Xiaoxiang Shi, Quanlu Zhang, Fan Yang, Saeed Maleki, Yi Zhu, Xu Cao, Cheng Li, Mao Yang, Lintao Zhang, Lidong Zhou

Abstract: With the growing model size, deep neural networks (DNN) are increasingly trained over massive GPU accelerators, which demands a proper parallelization plan that transforms a DNN model into fine-grained tasks and then schedules them to GPUs for execution. Due to the large search space, the contemporary parallelization plan generators often rely on empirical rules that couple transformation and sche… ▽ More With the growing model size, deep neural networks (DNN) are increasingly trained over massive GPU accelerators, which demands a proper parallelization plan that transforms a DNN model into fine-grained tasks and then schedules them to GPUs for execution. Due to the large search space, the contemporary parallelization plan generators often rely on empirical rules that couple transformation and scheduling, and fall short in exploring more flexible schedules that yield better memory usage and compute efficiency. This tension can be exacerbated by the emerging models with increasing complexity in their structure and model size. SuperScaler is a system that facilitates the design and generation of highly flexible parallelization plans. It formulates the plan design and generation into three sequential phases explicitly: model transformation, space-time scheduling, and data dependency preserving. Such a principled approach decouples multiple seemingly intertwined factors and enables the composition of highly flexible parallelization plans. As a result, SuperScaler can not only generate empirical parallelization plans, but also construct new plans that achieve up to 3.5X speedup compared to state-of-the-art solutions like DeepSpeed, Megatron and Alpa, for emerging DNN models like Swin-Transformer and AlphaFold2, as well as well-optimized models like GPT-3. △ Less

Submitted 21 January, 2023; originally announced January 2023.

arXiv:2301.08359 [pdf, other]

Domain-adapted Learning and Interpretability: DRL for Gas Trading

Authors: Yuanrong Wang, Yinsen Miao, Alexander CY Wong, Nikita P Granger, Christian Michler

Abstract: Deep Reinforcement Learning (Deep RL) has been explored for a number of applications in finance and stock trading. In this paper, we present a practical implementation of Deep RL for trading natural gas futures contracts. The Sharpe Ratio obtained exceeds benchmarks given by trend following and mean reversion strategies as well as results reported in literature. Moreover, we propose a simple but e… ▽ More Deep Reinforcement Learning (Deep RL) has been explored for a number of applications in finance and stock trading. In this paper, we present a practical implementation of Deep RL for trading natural gas futures contracts. The Sharpe Ratio obtained exceeds benchmarks given by trend following and mean reversion strategies as well as results reported in literature. Moreover, we propose a simple but effective ensemble learning scheme for trading, which significantly improves performance through enhanced model stability and robustness as well as lower turnover and hence lower transaction cost. We discuss the resulting Deep RL strategy in terms of model explainability, trading frequency and risk measures. △ Less

Submitted 10 September, 2023; v1 submitted 19 January, 2023; originally announced January 2023.

arXiv:2301.03004 [pdf, other]

doi 10.1016/j.nuclphysb.2023.116280

Joule-Thomson effect of AdS black holes in conformal gravity

Authors: Yang Guo, Hao Xie, Yan-Gang Miao

Abstract: We investigate the Joule-Thomson effect of AdS black holes in conformal gravity. We derive the Joule-Thomson coefficient in terms of thermodynamic relations and then make an alternative derivation via a direct way. We analyze the Joule-Thomson coefficient and find that the Joule-Thomson coefficients obtained from two different approaches are equal. Moreover, we present a novel isenthalpic process… ▽ More We investigate the Joule-Thomson effect of AdS black holes in conformal gravity. We derive the Joule-Thomson coefficient in terms of thermodynamic relations and then make an alternative derivation via a direct way. We analyze the Joule-Thomson coefficient and find that the Joule-Thomson coefficients obtained from two different approaches are equal. Moreover, we present a novel isenthalpic process in which the inversion temperature is minimal and it separates the corresponding heating-cooling phase. We analyze the inversion temperature and its corresponding inversion curve that separates the regions for the JT effect to be allowable and forbidden, where such an effect can only be observed in the allowable region. We also discuss the effects of two important parameters on the inversion curves. △ Less

Submitted 26 June, 2023; v1 submitted 8 January, 2023; originally announced January 2023.

Comments: v1: 10 pages, 4 figures; v2: 11 pages, clarifications and references added, final version to appear in Nuclear Physics B

Journal ref: Nucl. Phys. B 993 (2023) 116280 (10 pages)

arXiv:2212.04460 [pdf]

doi 10.1103/PhysRevLett.131.026701

Tunable van Hove singularity without structural instability in Kagome metal CsTi$_3$Bi$_5$

Authors: Bo Liu, Minquan Kuang, Yang Luo, Yongkai Li, Linwei Huai, Shuting Peng, Zhiyuan Wei, Jianchang Shen, Bingqian Wang, Yu Miao, Xiupeng Sun, Zhipeng Ou, Yugui Yao, Zhiwei Wang, Junfeng He

Abstract: In Kagome metal CsV$_3$Sb$_5$, multiple intertwined orders are accompanied by both electronic and structural instabilities. These exotic orders have attracted much recent attention, but their origins remain elusive. The newly discovered CsTi$_3$Bi$_5$ is a Ti-based Kagome metal to parallel CsV$_3$Sb$_5$. Here, we report angle-resolved photoemission experiments and first-principles calculations on… ▽ More In Kagome metal CsV$_3$Sb$_5$, multiple intertwined orders are accompanied by both electronic and structural instabilities. These exotic orders have attracted much recent attention, but their origins remain elusive. The newly discovered CsTi$_3$Bi$_5$ is a Ti-based Kagome metal to parallel CsV$_3$Sb$_5$. Here, we report angle-resolved photoemission experiments and first-principles calculations on pristine and Cs-doped CsTi$_3$Bi$_5$ samples. Our results reveal that the van Hove singularity (vHS) in CsTi$_3$Bi$_5$ can be tuned in a large energy range without structural instability, different from that in CsV$_3$Sb$_5$. As such, CsTi$_3$Bi$_5$ provides a complementary platform to disentangle and investigate the electronic instability with a tunable vHS in Kagome metals. △ Less

Submitted 8 December, 2022; originally announced December 2022.

Journal ref: Phys. Rev. Lett. 131, 026701 (2023)

arXiv:2212.01723 [pdf, other]

doi 10.1016/j.physletb.2023.137884

On heat properties of charged AdS black holes in Gauss-Bonnet gravity coupled with nonlinear electrodynamics

Authors: Yang Guo, Yan-Gang Miao

Abstract: We investigate the heat properties of charged AdS black holes in the Gauss-Bonnet gravity coupled with nonlinear electrodynamics. We consider the thermodynamics of black holes from the perspective of heat capacity and show that the nonlinear electrodynamics can be helpful to improve the thermodynamic stability of black holes. We perform a two-dimensional description in order to reproduce the Hawki… ▽ More We investigate the heat properties of charged AdS black holes in the Gauss-Bonnet gravity coupled with nonlinear electrodynamics. We consider the thermodynamics of black holes from the perspective of heat capacity and show that the nonlinear electrodynamics can be helpful to improve the thermodynamic stability of black holes. We perform a two-dimensional description in order to reproduce the Hawking temperature, which confirms that the Hawking temperature has an intrinsic topological nature and holds for a higher dimensional spherically symmetric spacetime. We also analyze the Maxwell equal area law and coexistence curve, and find the existence of van der Waals-like phase transitions based on critical exponents. Moreover, we deal with a charged AdS black hole in the Gauss-Bonnet gravity coupled with nonlinear electrodynamics as a working material to study holographic heat engines and obtain an exact expression for the efficiency of a rectangular engine cycle. We then discuss the effects of nonlinear electrodynamics and Gauss-Bonnet couplings on the rectangular engine cycle and compare the efficiency of this cycle with that of the Carnot cycle. △ Less

Submitted 8 April, 2023; v1 submitted 3 December, 2022; originally announced December 2022.

Comments: v1: 11 pages, 5 figures; v2: 19 pages, 6 figures, clarifications and references added, final version to appear in Physics Letters B

Journal ref: Phys. Lett. B 840 (2023) 137884 (12 pages)

arXiv:2211.15130 [pdf, other]

doi 10.1088/1674-1137/accdc7

Superradiance of massive scalar particles around rotating regular black holes

Authors: Hao Yang, Yan-Gang Miao

Abstract: Regular black holes, as an important attempt to eliminate the singularities in general relativity, have been widely concerned. Due to the fact that the superradiance associated with rotating regular black holes plays an indispensable role in black hole physics, we calculate the superradiance related effects, i.e., the superradiance instability and the energy extraction efficiency for a scalar part… ▽ More Regular black holes, as an important attempt to eliminate the singularities in general relativity, have been widely concerned. Due to the fact that the superradiance associated with rotating regular black holes plays an indispensable role in black hole physics, we calculate the superradiance related effects, i.e., the superradiance instability and the energy extraction efficiency for a scalar particle with a small mass around a rotating regular black hole, where the rotating regular black hole is constructed by the modified Newman-Janis algorithm. We analytically give the eigenfrequency associated with instability and the amplification factor associated with energy extraction. For two specific models, the rotating Hayward and Bardeen black holes, we investigate how their regularization parameters affect the growth of instability and the efficiency of energy extraction from the two rotating regular black holes. We find that the regularization parameters give rise to different modes on the superradiance instability and the energy extraction when the rotation parameters are varying. There are two modes for the growth of superradiance instability, and four modes for the energy extraction. Our results show the diversity of superradiance in the competition between the regularization parameter and the rotation parameter for rotating regular black holes. △ Less

Submitted 24 May, 2023; v1 submitted 28 November, 2022; originally announced November 2022.

Comments: v1: 32 pages, 15 figures, 1 appendix; v2: 36 pages, clarifications and one reference added, final version to appear in Chinese Physics C

Journal ref: Chinese Phys. C 47 (2023) 075101 (22 pages)

Showing 51–100 of 357 results for author: Miao, Y