Nonparametric Generation of Synthetic Data Using Copulas
dc.contributor.advisor | Laniado Rodas, Henry | spa |
dc.contributor.advisor | Rivera Agudelo, Juan Carlos | spa |
dc.contributor.author | Restrepo Lopera, Juan Pablo | |
dc.coverage.spatial | Medellín de: Lat: 06 15 00 N degrees minutes Lat: 6.2500 decimal degrees Long: 075 36 00 W degrees minutes Long: -75.6000 decimal degrees | eng |
dc.creator.degree | Magíster en Matemáticas Aplicadas | spa |
dc.creator.email | jurest82@eafit.edu.co | spa |
dc.creator.grantor | This research was funded by the call 852-2019 of the Ministry of Science, Technology and Innovation of the Republic of Colombia (MinCiencias), which allowed the development of the project with code 1216-852-72082 called “Descriptive and predictive analysis of the cement and concrete production process” | spa |
dc.date.accessioned | 2023-05-16T21:34:10Z | |
dc.date.available | 2023-05-16T21:34:10Z | |
dc.date.issued | 2023 | |
dc.description.abstract | This article presents a novel nonparametric approach to generate synthetic data using copulas, which are functions that explain the dependency structure of the real data. The proposed method addresses several challenges faced by existing synthetic data generation techniques, such as the preservation of complex multivariate structures presented in real data. By using all the information from real data and verifying that the generated synthetic data follows the same behavior as the real data under homogeneity tests, our method is a significant improvement over existing techniques. Our method is easy to implement and interpret, making it a valuable tool for solving class imbalance problems in machine learning models, improving the generalization capabilities of deep learning models, and anonymizing information in finance and healthcare domains, among other applications. | spa |
dc.identifier.ddc | 006.312 R436 | |
dc.identifier.uri | http://hdl.handle.net/10784/32480 | |
dc.language.iso | spa | spa |
dc.publisher | Universidad EAFIT | spa |
dc.publisher.department | Escuela de Ciencias Aplicadas e Ingeniería. Departamento de Ciencias Matemáticas | spa |
dc.publisher.place | Medellín | spa |
dc.publisher.program | Maestría en Matemáticas Aplicadas | spa |
dc.relation.uri | https://doi.org/10.3390/electronics12071601 | spa |
dc.relation.uri | https://github.com/jurest82/SyntheticDataCopulas | spa |
dc.rights | Todos los derechos reservados | spa |
dc.rights.accessrights | info:eu-repo/semantics/openAccess | spa |
dc.rights.local | Acceso abierto | spa |
dc.subject | Generación de datos sintéticos | spa |
dc.subject | Aumento de datos | spa |
dc.subject | Test de homogeneidad | spa |
dc.subject | Cópulas empíricas | spa |
dc.subject | Estadística no paramétrica | spa |
dc.subject.keyword | Synthetic data generation | spa |
dc.subject.keyword | Data augmentation | spa |
dc.subject.keyword | Homogeneity test | spa |
dc.subject.keyword | Empirical copulas | spa |
dc.subject.keyword | Nonparametric statistics | spa |
dc.subject.lemb | MATEMÁTICAS PARA INGENIEROS | spa |
dc.subject.lemb | DATOS ESTADÍSTICOS | spa |
dc.subject.lemb | MINERÍA DE DATOS | spa |
dc.title | Nonparametric Generation of Synthetic Data Using Copulas | spa |
dc.type | masterThesis | eng |
dc.type | info:eu-repo/semantics/masterThesis | eng |
dc.type.hasVersion | acceptedVersion | eng |
dc.type.local | Tesis de Maestría | spa |
dc.type.spa | Artículo | spa |
Archivos
Bloque original
1 - 3 de 3
No hay miniatura disponible
- Nombre:
- carta_aprobacion_trabajo_grado_eafit.pdf
- Tamaño:
- 78.09 KB
- Formato:
- Adobe Portable Document Format
- Descripción:
- Carta aprobacion trabajo de grado
No hay miniatura disponible
- Nombre:
- JuanPablo_RestrepoLopera_2023.pdf
- Tamaño:
- 8.83 MB
- Formato:
- Adobe Portable Document Format
- Descripción:
- Trabajo de grado
No hay miniatura disponible
- Nombre:
- formulario_autorizacion_publicacion_obras.pdf
- Tamaño:
- 1.31 MB
- Formato:
- Adobe Portable Document Format
- Descripción:
- Formulario autorizacion publicacion obras
Bloque de licencias
1 - 1 de 1
No hay miniatura disponible
- Nombre:
- license.txt
- Tamaño:
- 2.5 KB
- Formato:
- Item-specific license agreed upon to submission
- Descripción: