Aplicación de tiempos de viaje como semimétrica en k-means para la agrupación y optimización de rutas

Lasprilla Acevedo, Julián Felipe

Aplicación de tiempos de viaje como semimétrica en k-means para la agrupación y optimización de rutas

dc.contributor.advisor	Masmela Caita, Luis Alejandro
dc.contributor.author	Lasprilla Acevedo, Julián Felipe
dc.date.accessioned	2025-03-31T21:38:34Z
dc.date.available	2025-03-31T21:38:34Z
dc.date.created	2024-11-14
dc.description	Este trabajo explora la adaptación del algoritmo K-means para utilizar tiempos de viaje como una semimétrica, aplicada al análisis de datos georreferenciados en un contexto urbano, específicamente en la ciudad de Bogotá. Incorporando los tiempos de desplazamiento obtenidos a través de la API de Google Directions, el modelo permite formar clústeres que reflejan no solo la proximidad geográfica, sino también las condiciones de tráfico, haciendo el proceso de clusterización más realista en entornos urbanos complejos. Para evaluar la calidad de la agrupación se emplearon índices de validación como el índice de Silueta y el índice de Davies-Bouldin, los cuales indicaron configuraciones óptimas para los clústeres. Este enfoque contribuye a la planificación logística en la ciudad y abre oportunidades para futuras investigaciones. Entre las posibles extensiones se encuentran el uso de series de tiempo georreferenciadas para capturar patrones de tráfico a lo largo del día y la inclusión de distintos modos de transporte, como el público o los desplazamientos a pie. También se propone investigar variaciones en la semimétrica de tiempos de viaje, lo cual podría facilitar el uso de técnicas avanzadas como el clustering espectral, mejorando aún más el análisis de datos espaciales en aplicaciones urbanas.
dc.description.abstract	This work explores the adaptation of the K-means algorithm to use travel times as a semimetric, applied to the analysis of georeferenced data in an urban context, specifically in the city of Bogotá. By incorporating travel times obtained via the Google Directions API, the model allows the formation of clusters that reflect not only geographical proximity but also traffic conditions, making the clustering process more realistic in complex urban environments. To evaluate the quality of the clustering, validation indices such as the Silhouette index and the Davies-Bouldin index were employed, which indicated optimal configurations for the clusters. This approach contributes to logistical planning in the city and opens up opportunities for future research. Potential extensions include the use of georeferenced time series to capture traffic patterns throughout the day and the inclusion of different modes of transportation, such as public transit or walking. It is also proposed to investigate variations in the travel time semimetric, which could facilitate the use of advanced techniques such as spectral clustering, further enhancing the analysis of spatial data in urban applications.
dc.format.mimetype	pdf
dc.identifier.uri	http://hdl.handle.net/11349/94409
dc.language.iso	spa
dc.relation.references	C. C. Aggarwal and C. K. Reddy. Data Clustering: Algorithms and Applications. CRC Press, 2013. Disponible en Taylor & Francis. Describe métricas y semimétricas en clustering, incluyendo ejemplos de datos espaciales. pp. 205–210.
dc.relation.references	D. Arthur and S. Vassilvitskii. k-means++: The advantages of careful seeding. Proceedings of the eighteenth annual ACM-SIAM symposium on Discrete algorithms, 2007. pp. 1027–1035.
dc.relation.references	P. Berkhin. A survey of clustering data mining techniques. In Group Technology and Cellular Manufacturing. Springer, 2006. pp. 25–71.
dc.relation.references	C. M. Bishop. Pattern Recognition and Machine Learning. Springer, 2006. pp. 424–430.
dc.relation.references	D. L. Davies and D. W. Bouldin. A cluster separation measure. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1(2):224–227, 1979.
dc.relation.references	O. Dorabiala, D. V. Dabke, J. Webster, N. Kutz, and A. Aravkin. Spatiotemporal k-means. arXiv, 2022. Available at https://arxiv.org/abs/2211.05337.
dc.relation.references	G. Gan, C. Ma, and J. Wu. Data Clustering: Theory, Algorithms, and Applications. SIAM, 2007. pp. 153–157.
dc.relation.references	M. F. Goodchild and P. A. Longley. Geographic Information Systems and Science. John Wiley & Sons, 2005.
dc.relation.references	A. K. Jain, M. N. Murty, and P. J. Flynn. Data clustering: A review. ACM Computing Surveys, 31(3), 1999. pp. 264–323.
dc.relation.references	A. Kassambara. Practical Guide to Cluster Analysis in R: Unsupervised Machine Learning, volume 1 of Multivariate Analysis. Sthda, 2017. pp. 36–45.
dc.relation.references	S. P. Lloyd. Least squares quantization in pcm. IEEE Transactions on Information Theory, 28(2), 1982. pp. 129–137.
dc.relation.references	P. J. Rousseeuw. Silhouettes: A graphical aid to the interpretation and validation of cluster analysis. Journal of Computational and Applied Mathematics, 20, 1987. pp. 53–65.
dc.relation.references	S. Shekhar and S. Chawla. Spatial Databases: With Application to GIS. Prentice Hall, 2003. Disponible en Open Library. Discute el uso de semimétricas en GIS y la asimetría en tiempos de viaje. pp. 321–325.
dc.relation.references	M. N. S. R. Slava Kisilevich, Florian Mansmann. Spatio-temporal clustering. In O. Maimon and L. Rokach, editors, Data Mining and Knowledge Discovery Handbook. Springer, Boston, MA, 2005. p. 855.
dc.relation.references	P.-N. Tan, M. Steinbach, A. Karpatne, and V. Kumar. Introduction to Data Mining. Pearson, 2nd edition, 2019. pp. 487–490.
dc.relation.references	S. K. Thompson. Sampling. Wiley, 3rd edition, 2012. p. 59.
dc.relation.references	J. Zhou and L. Keller. Machine Learning: A Probabilistic Perspective. MIT Press, 2018. pp. 123–127.
dc.relation.references	JulianLasprilla1. Monografia- aplicación de tiempos de viaje como semimétrica en k-means para la agrupación y optimización de rutas, 2024. Disponible en GitHub. https://github.com/ JulianLasprilla1/Monografia.git.
dc.rights.acceso	Abierto (Texto Completo)
dc.rights.accessrights	OpenAccess
dc.subject	K-means
dc.subject	Semimétrica
dc.subject	Datos geoespaciales
dc.subject	Tiempos de viaje
dc.subject	Clusterización urbana
dc.subject	API google direction
dc.subject.keyword	K-means
dc.subject.keyword	Semimetric
dc.subject.keyword	Geospatial data
dc.subject.keyword	Travel times
dc.subject.keyword	Urban clustering
dc.subject.keyword	API Google Direction
dc.subject.lemb	Matemáticas -- Tesis y disertaciones académicas
dc.subject.lemb	Algoritmos
dc.subject.lemb	Geografía -- Análisis espacial
dc.subject.lemb	Análisis cluster
dc.title	Aplicación de tiempos de viaje como semimétrica en k-means para la agrupación y optimización de rutas
dc.title.titleenglish	Application of travel times as a semimetric in k-means for route clustering and optimization
dc.type	bachelorThesis
dc.type.coar	http://purl.org/coar/resource_type/c_7a1f
dc.type.degree	Monografía
dc.type.driver	info:eu-repo/semantics/bachelorThesis

Archivos

Bloque original

Mostrando 1 - 2 de 2

Nombre:: LasprillaAcevedoJulianFelipe2024.pdf
Tamaño:: 5.52 MB
Formato:: Adobe Portable Document Format

Descargar

Nombre:: Licencia de uso y autorización.pdf
Tamaño:: 218.2 KB
Formato:: Adobe Portable Document Format

Descargar

Bloque de licencias

Mostrando 1 - 1 de 1

Nombre:: license.txt
Tamaño:: 7 KB
Formato:: Item-specific license agreed upon to submission
Descripción:

Descargar

Colecciones

Matemáticas