The Data Quality Assessment Technique of Personal Data Operators Register

S. E. Dukhovenskiy, P. Pushkin, E. Nikulchev

Abstract


The work considers the issue of completeness and accuracy of personal data operators register published by Federal Service for Supervision in the Sphere of Telecom, Information Technologies and Mass Communications. The set of tools was designed and the data quality assessment technique was determined. The suggested technique was implemented via the information and analytical system including ETL module, metadata repository and check execution module. An experimental test of the implemented system covered a random sample of 1671 currently active operators. The resulting data quality assessment highlighted inaccuracies in records of 12% operators, including data incompleteness, data inconsistency, the presence of duplicates and some outliers beyond acceptable range. Revealed failures can be used to improve the electronic document interchange between operator community and Federal Service for Supervision in the Sphere of Telecom, Information Technologies and Mass Communications taking into account the recommendations, presented in the paper, such as electronic form improvements and operators’ self-check as part of document workflow.


Full Text:

PDF (Russian)

References


About Personal Data, Federal law of 27.07.2006 No. 152-FZ, http://pravo.gov.ru/proxy/ips/?docbody&nd=102108261 (in Rus)

Personal data operators register, https://pd.rkn.gov.ru/operators-registry/operators-list/ (in Rus)

V.P. Los, E.V., Nikulchev P.Y. Pushkin, A.M. Rusakov, “Infor-mation and analytical system for monitoring the compliance of personal data operators with the requirements of the legislation,“ Problems of information security. Computer systems, no. 3, pp. 16-23, 2020. (in Rus)

P.Y. Pushkin, A.M. Rusakov, “Results of automatic mining individual fields of personal data operators register,“ International Journal of Open Information Technologies, vol. 9, no. 1, pp. 37-47, 2021. (in Rus)

E.V. Nikulchev, P.Y. Pushkin, A.M. Rusakov, “Recommenda-tions for certain attribute filling of the personal data processing notifications by the university operator community“ in Information security of the educational process subject personality in the digital information and educational environment: collection of scientific articles, Moscow: Gubkin Russian State University of Oil and Gas, pp. 318-325. 2021. (in Rus)

Report on the activities of the Authorized Service for the Protec-tion of the Rights of Personal Data Subjects for 2019, https://rkn.gov.ru/docs/Otchet_UO-2019_new.pdf (in Rus)

Report on execution of the Plan and performance indicators by Federal Service for Supervision in the Sphere of Telecom, Infor-mation Technologies and Mass Communications in 2022, https://rkn.gov.ru/docs/doc_3806.pdf (in Rus)

A. A. Ilyin, “Automated technology for designing a data model when building an information and analytical system ,” Bulletin of Russian Universities. Mathematics, vol. 13, no. 1, pp. 89-90, 2008. (in Rus)

M. Borisyak, A. Ryzhikov, A. Ustyuzhanin, D. Derkach, F. Ratnikov, O. Mineeva, “(1+ ε)-class classification: an anomaly detection method for highly imbalanced or incomplete data sets,” The Journal of Machine Learning Research, vol. 21, no. 1, pp. 2768-2789, 2020.

W. Fan, “Data quality: From theory to practice,” ACM SIGMOD Record, vol. 44, no. 3, p. 7-18, 2015.

P. Oliveira, F. Rodrigues, P. R. Henriques, “A formal definition of data quality problems,” in Proceedings of the 2005 International Conference on Information Quality, MIT, 2005

J. Wang, Y. Liu, P. Li, Z. Lin, S. Sindakis, S. Aggarwal, “Over-view of data quality: examining the dimensions, antecedents, and impacts of data quality,” Journal of the Knowledge Economy, 2023 https://doi.org/10.1007/s13132-022-01096-6.

D. S. Sirisuriya, “A comparative study on web scraping,“ 2015. Available: http://ir.kdu.ac.lk/handle/345/1051.


Refbacks

  • There are currently no refbacks.


Abava  Кибербезопасность IT Congress 2024

ISSN: 2307-8162