RT info:eu-repo/semantics/article T1 On Detecting and Removing Superficial Redundancy in Vector Databases A1 Castro-García, Noemí de A1 Muñoz Castañeda, Ángel Luis A1 Fernández Rodríguez, Mario A1 Carriegos Vieira, Miguel A2 Algebra K1 Biblioteconomía K1 Python K1 MapReduce K1 Ciberseguridad K1 Optimización de datos K1 1203.12 Bancos de Datos AB A mathematical model is proposed in order to obtain an automatized tool to remove any unnecessary data, to compute the level of the redundancy, and to recover the original and filtered database, at any time of the process, in a vector database. This type of database can be modeled as an oriented directed graph. Thus, the database is characterized by an adjacency matrix. Therefore, a record is no longer a row but a matrix. Then, the problem of cleaning redundancies is addressed from a theoretical point of view. Superficial redundancy is measured and filtered by using the 1-norm of a matrix. Algorithms are presented by Python and MapReduce, and a case study of a real cybersecurity database is performed. PB Hindawi SN 1024-123X LK http://hdl.handle.net/10612/11344 UL http://hdl.handle.net/10612/11344 NO Mathematical Problems in Engineering, Vol. 2018, NO 14 p. DS BULERIA. Repositorio Institucional de la Universidad de León RD 19-abr-2024