Skip to main content
construction release_alert
Scholars@Duke will be undergoing maintenance April 11-15. Some features may be unavailable during this time.
cancel

Anru Zhang

Eugene Anson Stead, Jr. M.D. Associate Professor
Biostatistics & Bioinformatics, Division of Translational Biomedical

Selected Publications


Soft phenotyping for sepsis via EHR time-aware soft clustering.

Journal Article J Biomed Inform · February 27, 2024 OBJECTIVE: Sepsis is one of the most serious hospital conditions associated with high mortality. Sepsis is the result of a dysregulated immune response to infection that can lead to multiple organ dysfunction and death. Due to the wide variability in the c ... Full text Link to item Cite

Cocaine Use Prediction With Tensor-Based Machine Learning on Multimodal MRI Connectome Data.

Journal Article Neural Comput · December 12, 2023 This letter considers the use of machine learning algorithms for predicting cocaine use based on magnetic resonance imaging (MRI) connectomic data. The study used functional MRI (fMRI) and diffusion MRI (dMRI) data collected from 275 individuals, which was ... Full text Link to item Cite

Longitudinal changes in neurocognitive performance related to drug use intensity in a sample of persons with and without HIV who use illicit stimulants.

Journal Article Drug Alcohol Depend · October 1, 2023 BACKGROUND: Illicit stimulant use remains a public health concern that has been associated with multiple adverse outcomes, including cognitive deficits. The effects of stimulant use on cognition may be particularly deleterious in persons with HIV. Stimulan ... Full text Link to item Cite

Rungang Han and Anru R. Zhangs contribution to the Discussion of ‘Vintage factor analysis with varimax performs statistical inference’ by Rohe & Zeng

Journal Article Journal of the Royal Statistical Society. Series B: Statistical Methodology · September 1, 2023 In the 1930s, Psychologists began developing Multiple-Factor Analysis to decompose multivariate data into a small number of interpretable factors without any a priori knowledge about those factors. In this form of factor analysis, the Varimax factor rotati ... Full text Cite

Sparse and Low-Rank Tensor Estimation via Cubic Sketchings

Journal Article IEEE Transactions on Information Theory · September 2020 Full text Cite

Spectral State Compression of Markov Processes

Journal Article IEEE Transactions on Information Theory · May 2020 Full text Cite

Multisample estimation of bacterial composition matrices in metagenomics data

Journal Article Biometrika · March 1, 2020 SummaryMetagenomics sequencing is routinely applied to quantify bacterial abundances in microbiome studies, where bacterial composition is estimated based on the sequencing read counts. Due to limited sequen ... Full text Cite

ISLET: Fast and Optimal Low-Rank Tensor Regression via Importance Sketching

Journal Article SIAM Journal on Mathematics of Data Science · January 2020 Full text Cite

On the non-asymptotic and sharp lower tail bounds of random variables

Journal Article Stat · January 1, 2020 The non-asymptotic tail bounds of random variables play crucial roles in probability, statistics, and machine learning. Despite much success in developing upper bounds on tail probabilities in literature, the lower bounds on tail probabilities are relative ... Full text Cite

LTMG: a novel statistical modeling of transcriptional expression states in single-cell RNA-Seq data

Journal Article Nucleic Acids Research · October 10, 2019 AbstractA key challenge in modeling single-cell RNA-seq data is to capture the diversity of gene expression states regulated by different transcriptional regulatory inputs across individual cells, which is further complicat ... Full text Cite

Optimal Sparse Singular Value Decomposition for High-Dimensional High-Order Data

Journal Article Journal of the American Statistical Association · October 2, 2019 Full text Cite

Semi-supervised inference: General theory and estimation of means

Journal Article The Annals of Statistics · October 1, 2019 Full text Cite

Cross: Efficient low-rank tensor completion

Journal Article The Annals of Statistics · April 1, 2019 Full text Cite

Tensor SVD: Statistical and Computational Limits

Journal Article IEEE Transactions on Information Theory · November 2018 Full text Cite

Sequential rerandomization

Journal Article Biometrika · September 1, 2018 Full text Cite

Regression analysis for microbiome compositional data

Journal Article The Annals of Applied Statistics · June 1, 2016 Full text Cite

Structured Matrix Completion with Applications to Genomic Data Integration

Journal Article Journal of the American Statistical Association · April 2, 2016 Full text Cite

Instrumental Variables Estimation With Some Invalid Instruments and its Application to Mendelian Randomization

Journal Article Journal of the American Statistical Association · January 2, 2016 Full text Cite

Soft phenotyping for sepsis via EHR time-aware soft clustering.

Journal Article J Biomed Inform · February 27, 2024 OBJECTIVE: Sepsis is one of the most serious hospital conditions associated with high mortality. Sepsis is the result of a dysregulated immune response to infection that can lead to multiple organ dysfunction and death. Due to the wide variability in the c ... Full text Link to item Cite

Cocaine Use Prediction With Tensor-Based Machine Learning on Multimodal MRI Connectome Data.

Journal Article Neural Comput · December 12, 2023 This letter considers the use of machine learning algorithms for predicting cocaine use based on magnetic resonance imaging (MRI) connectomic data. The study used functional MRI (fMRI) and diffusion MRI (dMRI) data collected from 275 individuals, which was ... Full text Link to item Cite

Longitudinal changes in neurocognitive performance related to drug use intensity in a sample of persons with and without HIV who use illicit stimulants.

Journal Article Drug Alcohol Depend · October 1, 2023 BACKGROUND: Illicit stimulant use remains a public health concern that has been associated with multiple adverse outcomes, including cognitive deficits. The effects of stimulant use on cognition may be particularly deleterious in persons with HIV. Stimulan ... Full text Link to item Cite

Rungang Han and Anru R. Zhangs contribution to the Discussion of ‘Vintage factor analysis with varimax performs statistical inference’ by Rohe & Zeng

Journal Article Journal of the Royal Statistical Society. Series B: Statistical Methodology · September 1, 2023 In the 1930s, Psychologists began developing Multiple-Factor Analysis to decompose multivariate data into a small number of interpretable factors without any a priori knowledge about those factors. In this form of factor analysis, the Varimax factor rotati ... Full text Cite

Sparse and Low-Rank Tensor Estimation via Cubic Sketchings

Journal Article IEEE Transactions on Information Theory · September 2020 Full text Cite

Spectral State Compression of Markov Processes

Journal Article IEEE Transactions on Information Theory · May 2020 Full text Cite

Multisample estimation of bacterial composition matrices in metagenomics data

Journal Article Biometrika · March 1, 2020 SummaryMetagenomics sequencing is routinely applied to quantify bacterial abundances in microbiome studies, where bacterial composition is estimated based on the sequencing read counts. Due to limited sequen ... Full text Cite

ISLET: Fast and Optimal Low-Rank Tensor Regression via Importance Sketching

Journal Article SIAM Journal on Mathematics of Data Science · January 2020 Full text Cite

On the non-asymptotic and sharp lower tail bounds of random variables

Journal Article Stat · January 1, 2020 The non-asymptotic tail bounds of random variables play crucial roles in probability, statistics, and machine learning. Despite much success in developing upper bounds on tail probabilities in literature, the lower bounds on tail probabilities are relative ... Full text Cite

LTMG: a novel statistical modeling of transcriptional expression states in single-cell RNA-Seq data

Journal Article Nucleic Acids Research · October 10, 2019 AbstractA key challenge in modeling single-cell RNA-seq data is to capture the diversity of gene expression states regulated by different transcriptional regulatory inputs across individual cells, which is further complicat ... Full text Cite

Optimal Sparse Singular Value Decomposition for High-Dimensional High-Order Data

Journal Article Journal of the American Statistical Association · October 2, 2019 Full text Cite

Semi-supervised inference: General theory and estimation of means

Journal Article The Annals of Statistics · October 1, 2019 Full text Cite

Cross: Efficient low-rank tensor completion

Journal Article The Annals of Statistics · April 1, 2019 Full text Cite

Tensor SVD: Statistical and Computational Limits

Journal Article IEEE Transactions on Information Theory · November 2018 Full text Cite

Sequential rerandomization

Journal Article Biometrika · September 1, 2018 Full text Cite

Regression analysis for microbiome compositional data

Journal Article The Annals of Applied Statistics · June 1, 2016 Full text Cite

Structured Matrix Completion with Applications to Genomic Data Integration

Journal Article Journal of the American Statistical Association · April 2, 2016 Full text Cite

Instrumental Variables Estimation With Some Invalid Instruments and its Application to Mendelian Randomization

Journal Article Journal of the American Statistical Association · January 2, 2016 Full text Cite

Inference for high-dimensional differential correlation matrices

Journal Article Journal of Multivariate Analysis · January 2016 Full text Cite

ROP: Matrix recovery via rank-one projections

Journal Article The Annals of Statistics · February 1, 2015 Full text Cite

Sparse Representation of a Polytope and Recovery of Sparse Signals and Low-Rank Matrices

Journal Article IEEE Transactions on Information Theory · January 2014 Full text Cite

Sharp RIP bound for sparse signal and low-rank matrix recovery

Journal Article Applied and Computational Harmonic Analysis · July 2013 Full text Cite

Compressed Sensing and Affine Rank Minimization Under Restricted Isometry

Journal Article IEEE Transactions on Signal Processing · July 2013 Full text Cite