Singh R, Ling J, Doshi-Velez F. Structured Variational Autoencoders for the Beta-Bernoulli Process. Neural Information Processing Systems (NIPS) Workshop on Advances in Approximate Bayesian Inference. 2017. Paper
Tan S, Doshi-Velez F, Quiroz J, Glassman E. Clustering LaTeX Solutions to Machine Learning Assignments for Rapid Assessment. Advancing Education with Data Knowledge Discovery and Data Mining (KDD) Workshop. 2017. Paper
Depeweg S, Hernandez-Lobato JM, Doshi-Velez F, Udluft S. Uncertainty Decomposition in Bayesian Neural Networks with Latent Variables. International Conference on Machine Learning (ICML) Workshop. 2017. Paper
Parbhoo S, Bogojeska J, Zazzi M, Roth V, Doshi-Velez F. Combining Kernel and Model Based Learning for HIV Therapy Selection, in AMIA Summits on Translational Science Proceedings . Vol 2017. ; 2017 :239. Paper
Fan A, Doshi-Velez F, Miratrix L. Prior Matters: Simple and General Methods for Evaluating and Improving Topic Quality in Topic Modeling. Text as Data. 2017. Paper
Killian T, Daulton S, Konidaris G, Doshi-Velez F. Robust and Efficient Transfer Learning with Hidden Parameter Markov Decision Processes. Neural Information Processing Systems (NIPS). 2017. Paper
Wang T, Rudin C, Doshi-Velez F, Liu Y, Klampfl E, MacNeille P. A Bayesian Framework for Learning Rule Sets for Interpretable Classification. Journal of Machine Learning. 2017;18 (70) :1-37. Paper
Ross AS, Hughes MC, Doshi-Velez F. Right for the Right Reasons: Training Differentiable Models by Constraining their Explananations. International Joint Conference on Artificial Intelligence (IJCAI). 2017. Paper
Doshi-Velez F, Williamson S. Restricted Indian Buffet Processes. Statistics and Computing. 2017;27 (5) :1205-1223. Paper
Wu M, Ghassemi M, Fend M, Celi LA, Szolovits P, Doshi-Velez F. Understanding Vasopressor Intervention and Weaning: Risk Prediction in a Public Heterogeneous Clinical Time Series Database. Journal of the American Medical Informatics Association. 2017;24 (3) :488-495.Abstract


Background The widespread adoption of electronic health records allows us to ask evidence-based questions about the need for and benefits of specific clinical interventions in critical-care settings across large populations.

Objective We investigated the prediction of vasopressor administration and weaning in the intensive care unit. Vasopressors are commonly used to control hypotension, and changes in timing and dosage can have a large impact on patient outcomes.

Materials and Methods We considered a cohort of 15 695 intensive care unit patients without orders for reduced care who were alive 30 days post-discharge. A switching-state autoregressive model (SSAM) was trained to predict the multidimensional physiological time series of patients before, during, and after vasopressor administration. The latent states from the SSAM were used as predictors of vasopressor administration and weaning.

Results The unsupervised SSAM features were able to predict patient vasopressor administration and successful patient weaning. Features derived from the SSAM achieved areas under the receiver operating curve of 0.92, 0.88, and 0.71 for predicting ungapped vasopressor administration, gapped vasopressor administration, and vasopressor weaning, respectively. We also demonstrated many cases where our model predicted weaning well in advance of a successful wean.

Conclusion Models that used SSAM features increased performance on both predictive tasks. These improvements may reflect an underlying, and ultimately predictive, latent state detectable from the physiological time series.


Depewag S, Hernández-Lobato JM, Doshi-Velez F, Udluft S. Learning and Policy Search in Stochastic Dynamical Systems with Bayesian Neural Networks. ICLR. 2017.Abstract

We present an algorithm for model-based reinforcement learning that combines Bayesian neural networks (BNNs) with random roll-outs and stochastic optimization for policy learning. The BNNs are trained by minimizing α  -divergences, allowing us to capture complicated statistical patterns in the transition dynamics, e.g. multi-modality and heteroskedasticity, which are usually missed by other common modeling approaches. We illustrate the performance of our method by solving a challenging benchmark where model-based approaches usually fail and by obtaining promising results in a real-world scenario for controlling a gas turbine.

Parbhoo S, Bogojeska J, Zazzi M, Roth V, Doshi-Velez F. Combining Kernel and Model Based Learning for HIV Therapy Selection. Neural Information Processing Systems (NIPS) Workshop for Machine Learning and Healthcare. 2016. Paper
Killian TW, Konidaris G, Doshi-Velez F. Transfer Learning Across Patient Variations with Hidden Parameter Markov Decision Processes. Neural Information Processing Systems (NIPS) Workshop for Machine Learning and Healthcare. 2016. Paper
Hughes MC, Elibol HM, McCoy T, Perlis R, Doshi-Velez F. Supervised topic models for clinical interpretability. Neural Information Processing Systems (NIPS) Workshop for Machine Learning and Healthcare. 2016. Paper
Masood MA, Doshi-Velez F. Robust Posterior Exploration in NMF. International Conference on Machine Learning (ICML) Workshop on Geometry in Machine Learning. 2016. Paper
Shain C, Bryce W, Jin L, Krakovna V, Doshi-Velez F, Miller T, Schuler W, Schwartz L. Memory-Bounded Left-Corner Unsupervised Grammar Induction on Child-Directed Input. Computational Linguistics: Technical Papers (COLING). 2016 :964-975. Paper
Doshi-Velez F, Konidaris G. Hidden Parameter Markov Decision Processes: A Semiparametric Regression Approach for Discovering Latent Task Parametrizations. IJCAI. 2016. Paper
Elibol M, Nguyen V, Linderman S, Johnson M, Hashmi A, Doshi-Velez F. Cross-Corpora Unsupervised Learning of Trajectories in Autism Spectrum Disorders. Journal of Machine Learning Research. 2016;17 (1) :4597-4634. Paper
Tran D, Kim M, Doshi-Velz F. Spectral M-estimation with Application to Hidden Markov Models: Supplementary Material. AISTATS. 2016. Paper
Pan W, Doshi-Velez F. A Characterization of the Non-Uniqueness of Nonnegative Matrix Factorizations. arXiv:1604.00653 . 2016.Abstract

Nonnegative matrix factorization (NMF) is a popular dimension reduction technique that produces interpretable decomposition of the data into parts. However, this decompostion is not generally identifiable (even up to permutation and scaling). While other studies have provide criteria under which NMF is identifiable, we present the first (to our knowledge) characterization of the non-identifiability of NMF. We describe exactly when and how non-uniqueness can occur, which has important implications for algorithms to efficiently discover alternate solutions, if they exist.