Classifying Neuronal Cell Types Based on Shared Electrophysiological Information from Humans and Mice
Innovative Fusion in Neuron Classification: Shared Information from Human and Mouse Electrophysiological Data
The scientific community has long faced significant challenges in neuron classification. Accurate classification of neurons is crucial for understanding brain function in both healthy and diseased states. This study, led by Ofek Ophir, Orit Shefi, and Ofir Lindenbaum from Bar-Ilan University, published in the journal Neuroinformatics, proposes a novel machine learning framework that classifies neurons by jointly using electrophysiological data from humans and mice.
Research Background
Neurons are the basic units of the nervous system, and their classification has been a core issue in neuroscience since Ramon y Cajal published Histology of the Nervous System of Man and Vertebrates in 1995. Classifying neurons aids in consistent identification across different laboratories and experimental conditions, which is crucial for understanding brain function and its alterations in healthy and pathological states.
Research Origin
This study was conducted by the Faculty of Engineering and the Leslie & Susan Gonda Multidisciplinary Brain Research Center at Bar-Ilan University (Ramat-Gan, Israel). The article was accepted on June 10, 2024, and published in the journal Neuroinformatics, DOI link: https://doi.org/10.1007/s12021-024-09675-5.
Research Process
Data Sources
The research data primarily comes from the Allen Cell Types Database (ACTD), which contains biological data from single-cell recordings of adult mice and humans. The mouse data includes recordings from 1920 cells, while the human data includes recordings from 413 cells.
Data Preprocessing
Through the extraction of electrophysiological features, the research team analyzed 41 electrophysiological tabular features. Data records included four types of stimulation conditions: noise stimulus, ramp stimulus, rectangular stimulus, and short rectangular stimulus, each designed to elicit different types of action potential (AP) responses.
Research Tasks
The research was divided into two main tasks:
- Classification of broad neuron types (excitatory and inhibitory) in humans and mice
- Classification of neuron subtypes using electrophysiological data from mice
Classification Models
The study employed two neural network models: - Domain-Adaptive Neural Network (DANN) - Locally Sparse Interpretable Network (LSPIN)
Domain-Adaptive Classification with DANN Model
To address the scarcity of human data samples, the research team used the DANN model to embed shared information from mouse data into human data. By aligning the distributions of the two domains, cross-domain classification was achieved.
Multi-Label Classification with LSPIN Model
Given the richness of mouse samples, the research team used the LSPIN model to handle the classification of five subtypes. This approach overcame the overfitting problem associated with low sample size data and achieved interpretable classification by predicting the most informative features for each sample.
Research Results
Task One: Broad Neuron Type Classification in Humans and Mice
Using the DANN model, the study demonstrated that whole-cell current clamp recordings in the mouse brain are similar to those in the human brain. The model showed high accuracy in classifying neuron types in both humans and mice, achieving 95.0% accuracy for human samples and 97.4% accuracy for mouse samples.
Task Two: Mouse Neuron Subtype Classification
Using the LSPIN model, the study achieved a rare high accuracy of 91.6% in classifying five subtypes, outperforming traditional machine learning models such as Random Forest (RF), Support Vector Classifier (SVC), and XGBoost. Additionally, the model provided interpretability in feature selection for each subtype.
Research Significance
Scientific Value
This study utilized a cross-species data fusion approach to improve the accuracy of neuron classification, contributing to a deeper understanding of the electrophysiological characteristics of neurons. This method can be used in real-time clinical applications, providing a basis for early diagnosis and treatment planning.
Practical Value
The DANN model effectively addresses the domain transfer issue in neuron classification, enabling the model to generalize across different organisms while maintaining high accuracy. The LSPIN model reduces overfitting through feature selection, enhancing classification interpretability, which is crucial for clinical applications.
Research Highlights
- Cross-Domain Classification: By integrating data from humans and mice, the study addresses data scarcity and domain transfer issues.
- Model Interpretability: The LSPIN model not only provides highly accurate classification results but also reveals the importance of features, offering new insights for biological feature research.
Future Research Directions
Future research could consider: 1. Expanding the range of species classified to explore the evolutionary conservation of different neuron types. 2. Applying the DANN method in an unsupervised manner to further improve classification accuracy for human neuron data. 3. Evaluating the model’s generalization capability and verifying its applicability across different laboratories and experimental conditions. 4. Deeply analyzing the biological characteristics of each neuron type to increase the interpretability of model predictions.
Conclusion
This study provides new insights into neuron classification methods by demonstrating the importance of cross-species data fusion and feature selection through the DANN and LSPIN models. The results not only enhance scientific understanding but also offer reliable tools for practical applications.