Blockchain-based Data Quality Assessment to Improve Distributed Machine Learning


Yao Du, Zehua (David) Wang, Victor Leung, Cyril Leung

Date of Publication


Data quality assessment is critical for distributed machine learning (DML). Data collected from heterogeneous Internet of things (IoT) devices may contain biased information that decreases the prediction accuracy of DML models. To address these challenges, we propose a blockchain-based approach to assess the quality of data that are not independent and identically distributed (non-IID). A blockchain running atop mobile edge computing (MEC) is helpful to protect privacy, security, and integrity of healthcare data when IoT devices are connected to MEC servers. Therefore, it is critical to integrate data quality assessment module on blockchain when building a blockchain-enabled DML system. In this paper, we jointly consider information loss and marginal utility of non-IID data samples. Specifically, we use Kullback-Leibler (KL) divergence to evaluate the information loss between IID and non-IID data samples and apply the reciprocal of data quantity to model the marginal utility of data samples. Human activities and handwritten digit recognition data sets are used for performance evaluations. Experiments show that our proposed scheme outperforms benchmarks regarding model test accuracy on various non-IID data samples.

External Link

Read the Research Paper

  • Conference Paper

First Nations land acknowledegement

We acknowledge that the UBC Point Grey campus is situated on the traditional, ancestral, and unceded territory of the xʷməθkʷəy̓əm.

UBC Crest The official logo of the University of British Columbia. Urgent Message An exclamation mark in a speech bubble. Caret An arrowhead indicating direction. Arrow An arrow indicating direction. Arrow in Circle An arrow indicating direction. Arrow in Circle An arrow indicating direction. Chats Two speech clouds. Facebook The logo for the Facebook social media service. Information The letter 'i' in a circle. Instagram The logo for the Instagram social media service. External Link An arrow entering a square. Linkedin The logo for the LinkedIn social media service. Location Pin A map location pin. Mail An envelope. Menu Three horizontal lines indicating a menu. Minus A minus sign. Telephone An antique telephone. Plus A plus symbol indicating more or the ability to add. Search A magnifying glass. Twitter The logo for the Twitter social media service. Youtube The logo for the YouTube video sharing service.