NeuroCrypt: Machine Learning Over Encrypted Distributed Neuroimaging Data

5Citations
Citations of this article
20Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The field of neuroimaging can greatly benefit from building machine learning models to detect and predict diseases, and discover novel biomarkers, but much of the data collected at various organizations and research centers is unable to be shared due to privacy or regulatory concerns (especially for clinical data or rare disorders). In addition, aggregating data across multiple large studies results in a huge amount of duplicated technical debt and the resources required can be challenging or impossible for an individual site to build. Training on the data distributed across organizations can result in models that generalize much better than models trained on data from any of organizations alone. While there are approaches for decentralized sharing, these often do not provide the highest possible guarantees of sample privacy that only cryptography can provide. In addition, such approaches are often focused on probabilistic solutions. In this paper, we propose an approach that leverages the potential of datasets spread among a number of data collecting organizations by performing joint analyses in a secure and deterministic manner when only encrypted data is shared and manipulated. The approach is based on secure multiparty computation which refers to cryptographic protocols that enable distributed computation of a function over distributed inputs without revealing additional information about the inputs. It enables multiple organizations to train machine learning models on their joint data and apply the trained models to encrypted data without revealing their sensitive data to the other parties. In our proposed approach, organizations (or sites) securely collaborate to build a machine learning model as it would have been trained on the aggregated data of all the organizations combined. Importantly, the approach does not require a trusted party (i.e. aggregator), each contributing site plays an equal role in the process, and no site can learn individual data of any other site. We demonstrate effectiveness of the proposed approach, in a range of empirical evaluations using different machine learning algorithms including logistic regression and convolutional neural network models on human structural and functional magnetic resonance imaging datasets.

References Powered by Scopus

ImageNet: A Large-Scale Hierarchical Image Database

52485Citations
N/AReaders
Get full text

Learning representations by back-propagating errors

21092Citations
N/AReaders
Get full text

A survey on deep learning in medical image analysis

9760Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Federated Analysis of Neuroimaging Data: A Review of the Field

16Citations
N/AReaders
Get full text

Decision-making Support System for Predicting and Eliminating Malnutrition and Anemia

6Citations
N/AReaders
Get full text

Enhancing collaborative neuroimaging research: introducing COINSTAC Vaults for federated analysis and reproducibility

2Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Senanayake, N., Podschwadt, R., Takabi, D., Calhoun, V. D., & Plis, S. M. (2022). NeuroCrypt: Machine Learning Over Encrypted Distributed Neuroimaging Data. Neuroinformatics, 20(1), 91–108. https://doi.org/10.1007/s12021-021-09525-8

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 5

83%

Researcher 1

17%

Readers' Discipline

Tooltip

Engineering 3

43%

Neuroscience 2

29%

Nursing and Health Professions 1

14%

Computer Science 1

14%

Save time finding and organizing research with Mendeley

Sign up for free