Research on Data Currency Rule and Quality Evaluation
Keywords:data currency, currency rule, data quality, quality evaluation, parallel algorithm
Data currency is a temporal reference of data, it reflects the degree to which the data is current with the world
it models. Currency rule is a formal rule extracted from the data set and reflecting the currency order of the
data tuples, it can be used for both data repairing and currency quality evaluation. Based on the research of data
currency repairing, the basic form of currency rule is extended, and parallel rule extraction and update algorithms
are proposed to meet the requirement of running on dynamic data sets. Besides, four data currency quality
evaluation models are proposed and verified by experiments. The performance test show that the efficiency
of parallel algorithms is significantly improved, the rules compliance mean(CM2) model based on extended
currency rule has the highest average precision. The extended currency rules not only improve the efficiency
and adaptability, but also provide more valuable features for data quality evaluation.
Copyright terms are indicated in the Republic of Lithuania Law on Copyright and Related Rights, Articles 4-37.