A curated list of advancements in Vertical Federated Learning (VFL), frameworks and libraries.
- Publications in Top-tier Conferences
- VFL Datasets (or datasets that are used in benchmarks)
- Frameworks and Libraries with VFL support
Type | Title | Year | Conference / Journal | Description |
---|---|---|---|---|
VFL | Vertical Federated Learning: Concepts, Advances and Challenges | 2023 | Arxiv | |
General | Towards Open Federated Learning Platforms: Survey and Vision from Technical and Legal Perspectives | 2024 | Arxiv |
Bench Type | Title | Year | Conference | Code | Algorithms |
---|---|---|---|---|---|
VFL | Stalactite: Toolbox for Fast Prototyping of Vertical Federated Learning Systems | 2024 | RecSys | Code | --- |
VFL | VertiBench: Advancing Feature Distribution Diversity in Vertical Federated Learning Benchmarks | 2024 | ICLR | Code Website | GAL, C-VFL, SecureBoost, Pivot, FedTree, FedOnce |
VFL | VFLAIR: A Research Library and Benchmark for Vertical Federated Learning | 2024 | ICLR | Code | --- |
VFL | FedAds: A Benchmark for Privacy-Preserving CVR Estimation with Vertical Federated Learning | 2023 | SIGIR | Code | --- |
General | The OARF Benchmark Suite: Characterization and Implications for Federated Learning Systems | 2022 | ACM Transactions on Intelligent Systems and Technology | Code | --- |
General | Fedml: A research library and benchmark for federated machine learning | 2020 | arxiv | Code | --- |
Title | Year | Conference / Journal | Description |
---|---|---|---|
Privacy Matters: Vertical Federated Linear Contextual Bandits for Privacy Protected Recommendation | 2023 | KDD | --- |
A Unified Solution for Privacy and Communication Efficiency in Vertical Federated Learning | 2023 | Neurips | --- |
Differentially Private Vertical Federated Clustering | 2023 | VLDB | --- |
Title | Year | Conference / Journal | Description |
---|---|---|---|
Fair and Efficient Contribution Valuation for Vertical Federated Learning | 2024 | ICLR | Clients' contribution valuation metric - vertical federated Shapley value (VerFedSV) |
Type | Dataset | Modality | Link | Benchmark | # parties | # samples | # features | # classes |
---|---|---|---|---|---|---|---|---|
VFL-native | NUS-WIDE | Image | Link | VertiBench, VFLAIR | 5 | 269,648 | 64 / 144 / 73 / 128 / 225 | 2 |
VFL-native | Satellite | Image | Link | VertiBench | 16 | 3,927 | 13-channel 158x158 | 4 |
VFL-native | Vehicle | Acoustic, Seismic | Link | VertiBench | 2 | 78,823 | 50 / 50 | 3 |
VFL-native | FedAds | Table | Link | FedAds | 2 | 11,300,000 | 16 / 7 | - |
Centralized | covtype | Table | Link | VertiBench | - | 581,012 | 54 | 7 |
Centralized | msd | Table | Link | VertiBench | - | 463,715 | 90 | - |
Centralized | realsim | Table | Link | VertiBench | - | 72,309 | 20,958 | 2 |
Centralized | gisette | Table | Link | VertiBench | - | 60,000 | 5,000 | 2 |
Centralized | epsilon | Table | Link | VertiBench, FedAds | - | 400,000 | 2,000 | 2 |
Centralized | letter | Table | Link | VertiBench | - | 15,000 | 16 | 26 |
Centralized | radar | Table | Link | VertiBench | - | 15,000 | 16 | 26 |
Centralized | MNIST | Image | Link | VertiBench, VFLAIR | - | 325,834 | 174 | 7 |
Centralized | CIFAR10 | Image | Link | VertiBench, VFLAIR | - | 60,000 | 1,024 | 10 |
Centralized | CIFAR100 | Image | Link | VFLAIR | - | 60,000 | 1,024 | 100 |
Centralized | Breast Cancer | Table | Link | VFLAIR | - | 569 | 32 | 2 |
Centralized | Pima Indians Diabetes | Table | Link | VFLAIR | - | 768 | 9 | 2 |
Centralized | Breast histopathology images | Image | Link | FedAds | ||||
Centralized | Yahoo answers dataset | Text | Link | FedAds | ||||
Centralized | Give Me Some Credit | Tabilar | link | FedAds | ||||
Centralized | Avazu | Tabilar | link | FedAds | - | 45,006,432 | 23 | 2 |
VFL-related (hetero in FATE terminology) features:
- privacy-preserving strategies: SSHE and FedPass
Implements a bunch of practical algorithms in horizontal and vertical FL settings