Multilevel Ensemble Membership Inference Attack

2025-02-05 Wed
machine learning privacy attack membership inference data leakage AI security
In-depth Analysis of the Research Paper: MEMIA: Multilevel Ensemble Membership Inference AttackIntroduction to the Research BackgroundWith the rapid development of digital technologies, artificial intelligence (AI) and machine learning (ML) have deeply permeated multiple domains, including healthcare, finance, retail, education, and social media. However, alongside the widespread application of these technologies, the risk of privacy leakage has also significantly heightened. Numerous studies have demonstrated that machine learning models are prone to adversarial attack threats, among which a critical form of privacy attack is the Membership Inference Attack (MIA). The core objective of MIA is to infer whether a specific data sample was part of the model’s training process by analyzing the target model’s output distribution. Nevertheless, existing MIA methods face many limitations, especially in scenarios where datasets have fewer classes or target models are underfitted, leading to significantly reduced attack accuracy. Consequently, improving the performance of MIA methods has become a pressing challenge.
The authors of this paper, Najeeb Ullah, Muhammad Naveed Aman, and Biplab Sikdar, delved deeply into this issue and proposed a novel multilevel ensemble learning framework, MEMIA (Multilevel Ensemble Membership Inference Attack). This framework leverages the characteristics of Neural Networks (NN) and Long Short-Term Memory (LSTM) networks. By utilizing embedding techniques, it captures distribution and sequential order differences between member and non-member samples, thereby significantly enhancing attack performance.
Paper Source and Author BackgroundThis paper was published in the IEEE Transactions on Artificial Intelligence journal (Volume 6, Issue 1, January 2025), and it is an original research piece with high academic value in the field of privacy protection. The first author, Najeeb Ullah, and co-author, Biplab Sikdar, are affiliated with the Department of Electrical and Computer Engineering at the National University of Singapore. Meanwhile, Muhammad Naveed Aman is a researcher at the School of Computing, University of Nebraska-Lincoln, USA. The research was supported by funding from institutions including the Asian Institute of Digital Finance.
Detailed Research WorkflowThe focus of MEMIA research is to enhance the performance of MIA methods using a stacked ensemble learning approach. The research is structured into several steps, each involving specific experimental subjects, procedural descriptions, and details of algorithm implementation.
1. Dataset Selection and PreprocessingThe study selected seven different benchmark datasets to evaluate the applicability of the method. All image-based datasets were resized to 32×32 pixels to ensure feature consistency. These datasets include:
- FMNIST (Fashion-MNIST): 70,000 grayscale images of 10 classes representing fashion items.
- UTKFace: 22,012 facial images annotated with age, race, and gender labels.
- Location: A geosocial dataset from a social network, represented by 446 binary features indicating a user’s location visitation history.
- Purchase-100: A dataset of 100 shopping behavior classes, with each sample represented by 600 binary features.
- CIFAR-10⁄100 and STL-10: Standard image classification datasets.
The datasets were split into two parts for training the target model and the shadow model, ensuring complete separation between the two datasets to increase attack difficulty.
2. Target Model TrainingDifferent datasets employed distinct target model architectures:
- Image datasets utilized a Convolutional Neural Network (CNN) architecture consisting of three convolutional layers and two fully connected layers.
- Other datasets such as Location and Purchase-100 used two-layer and six-layer Fully Connected Neural Networks (FCNN), respectively.
The experiments trained the target models using Stochastic Gradient Descent (SGD) optimizer with a cross-entropy loss function. The training and testing accuracies of the target models are summarized in the table below:


Dataset
Training Accuracy
Testing Accuracy


FMNIST
99.9%
90.0%

UTKFace
99.9%
83.2%

STL-10
99.9%
54.8%

CIFAR-10
99.8%
60.2%

CIFAR-100
99.7%
27.2%

Location
98.9%
68.7%

Purchase-100
99.9%
66.0%

3. MEMIA Attack Model DesignMEMIA adopts a stacked ensemble learning architecture, consisting of two base models and one meta-model:
1. Base Neural Network Model (Base NN): Captures distribution differences between member and non-member samples.
2. Base Long Short-Term Memory Model (Base LSTM): Handles the sequential information in prediction vectors.
3. Meta Neural Network Model (Meta-Model): Combines the embeddings from the two base models to learn more enriched features.
During training, all models used the Adam optimizer. The Base NN had a four-layer fully connected structure, while the Base LSTM comprised a three-layer architecture. The Meta-Model integrated the outputs of the two base models.
Key Research Results1. Attack Performance AnalysisThe results demonstrate that MEMIA significantly outperforms state-of-the-art MIA methods, with particularly outstanding performance on complex datasets like CIFAR-100, achieving an attack accuracy of 94.6%, compared to 90.6% by the previous MIA methods.
2. Relationship Between Data Complexity and Attack CapabilityExperiments reveal that MEMIA’s attack capability improves as the dataset’s class diversity and sample complexity increase:
- On simpler datasets like FMNIST, MEMIA improved accuracy by approximately 3%.
- On more complex datasets like CIFAR-10 and STL-10, attack accuracy saw an increase of over 6% and 8%, respectively.
3. Precision and CoverageMEMIA excelled in both precision and recall metrics. Based on the analysis of the Cumulative Distribution Function (CDF), MEMIA distinguishes member from non-member samples more accurately with fewer false positives compared to traditional MIA methods, ensuring higher coverage.
Scientific Significance and Practical ValueScientific Contribution: MEMIA’s multilevel learning framework substantially enhances MIA performance, providing an important benchmark for privacy protection research. The results underscore the value of multilevel feature extraction.
Practical Relevance: MEMIA poses new challenges to privacy protection research while offering directions for developing more robust privacy defense mechanisms.
Innovative Highlights:
For the first time, it integrates multilevel model embeddings to significantly enhance the inference abilities of attack models.
It provides a generalizable approach independent of the number of data classes, making it suitable for black-box attack scenarios.
Future Research DirectionsIn the future, the authors propose exploring more advanced architectures, such as Transformer-based models, which exploit attention mechanisms to further enhance MIA performance. Additionally, strengthening MEMIA against potential defenses, such as incorporating gradient masking techniques in training shadow models, will also become a key research focus.
The research on MEMIA not only challenges existing privacy protection techniques but also lays a solid foundation for optimizing the privacy security of machine learning algorithms.
Dataset	Training Accuracy	Testing Accuracy
FMNIST	99.9%	90.0%
UTKFace	99.9%	83.2%
STL-10	99.9%	54.8%
CIFAR-10	99.8%	60.2%
CIFAR-100	99.7%	27.2%
Location	98.9%	68.7%
Purchase-100	99.9%	66.0%