Andrea Vanessa Hurtado Quiceno

Logo

View the Project on GitHub andrea072693/Andrea_Vanessa_Hurtado_Quiceno_DataLab.github.io

Statistics and Data Science

Welcome to my GitHub portfolio! I am Andrea Vanessa Hurtado Quiceno, a PhD candidate at RPTU Kaiserslautern, specializing in Stochastic Analysis and Differential Geometry. Throughout my academic journey, I have engaged in projects ranging from Probability Theory to Statistical Modellisation, including the use of topological data analysis for machine learning. This portfolio serves as a showcase of my research, academic achievements, and projects, reflecting my commitment to pushing the boundaries of knowledge in mathematics and data science.

Personal Information

Education

Qualifications

Work Experience

Topological Data Analysis

Topological Descriptors for Machine Translation and Word Embeddings

Master 1 in Modelisation Statistique master’s project
Université Franche-Comté, Besançon (2022 – 2023)

This project involved exploring topological data analysis (TDA) techniques applied to machine translation and word embeddings. I developed methods to capture topological structures in high-dimensional data spaces, using persistent homology to analyze and compare linguistic datasets. The goal was to improve translation models by integrating topological descriptors, creating more robust and interpretable machine learning models.

Key Skills and Tools:

More detailed information and code for this project will be added soon!

C++ Project

Master 1 in Modelisation Statistique master’s project
Université Franche-Comté, Besançon (2022 – 2023)

Monopoly Game (Console Game)

This project involved creating a console-based version of the popular Monopoly board game using C++. The game simulates the traditional Monopoly experience, where players roll dice, purchase properties, and aim to bankrupt their opponents. The game was designed for 2 to 4 players, with each player starting with 500 “polycurrency”. Players could upgrade properties, pay rent, and interact with special game elements like the “Policier” box and prison.

Key Features:

Problem-Solving Approach:

Key Skills and Tools:

Player A arrives to the Police

Figure 1: Player A arrives to the Police

Player A is send to prison

Figure 2: Player A is send to prison

Linear Models

Master 1 in Modelisation Statistique projects
Université Franche-Comté, Besançon (2022 – 2023)

Linear Models Project (TP1)

This project tackled simple and multiple linear regression models to predict apartment prices and body fat percentages using various predictors. The analysis explored the effects of centering, scaling variables, and even creating custom regression functions for better understanding of the data.

Problem-Solving Approach:

Key Skills and Tools:

View the Full Report

View R Document

Linear Models Project (TP2)

This project involved using multiple linear regression models to predict ozone concentration based on meteorological data. The dataset included variables like temperature, cloud cover, wind speed, and ozone concentration from the previous day. The goal was to answer several key statistical questions, such as model adequacy, confidence interval estimation, and bias detection.

Key Questions Solved:

Key Skills and Tools:

View HTML TP2 report

View R Document

Linear Models Project (TP3)

This project focused on building and validating multiple linear regression models using the Ozone dataset. The primary goal was to predict maximum daily ozone concentration based on environmental variables such as temperature, cloud cover, wind speed, and ozone concentration from the previous day. I explored model assumptions, detected influential points, and applied model selection techniques, including AIC, BIC, and cross-validation, to choose the best performing model.

Key Skills and Tools:

View R Document

Data Analysis Lecture

Master 1 in Modelisation Statistique projects
Université Franche-Comté, Besançon (2022 – 2023)

Socioeconomic Differences Between Bourgogne and Franche-Comté Regions in 2012: A PCA Analysis

PCA Analysis Project

This project focused on analyzing socioeconomic disparities between Bourgogne and Franche-Comté using data from the 2012 national French census. Principal Component Analysis (PCA) was used to investigate relationships between demographic, employment, education, and housing variables across the two regions.

Problem-Solving Approach:

Communes Figure 3: Communes population in both region

Key Tools:

View R Document

Recommendations for Further Study:

A deeper analysis of communes in both regions, potentially including additional variables like salary, tax payments, and financial background to better understand economic conditions.

Singular Value Decomposition (SVD) for Image Reconstruction

This project demonstrates the application of Singular Value Decomposition (SVD) for image compression and reconstruction. Using a grayscale image of Montmartre, we employed SVD to reduce the image’s dimensionality while preserving its essential features.

Objectives:

Key Findings:

Tools Used:

View PDF Document

Simulation in R

Master 1 in Modelisation Statistique master’s project
Université Franche-Comté, Besançon (2022 – 2023)

This project simulates the random walk of a turtle on a 2D grid using Monte Carlo methods. The goal is to estimate the distribution of the number of revisits ((N_n)) to previously visited points for different step sizes (n = 100), (n = 1000), and (n = 10000).

Authors:

Problem Description

The turtle starts at the origin ((0,0)) and moves randomly in one of four directions: ((0,1)), ((1,0)), ((0,-1)), or ((-1,0)). After each step, we record whether the turtle revisits a previously visited point.

For each value of (n), we calculate (N_n), the number of revisits, and use Monte Carlo simulations to analyze the distribution of (N_n).

Steps:

  1. Simulate the random walk for different values of (n).
  2. Use Monte Carlo methods to estimate the distribution of revisits.
  3. Visualize the results using histograms and plots.

Key Aspects

R Packages Used

View HTML Simulations

View R code

Statistics Lecture

Master 1 in Modelisation Statistique master’s project
Université Franche-Comté, Besançon (2022 – 2023)

This project analyzes simulated data using statistical methods. The goal is to explore various aspects of the data, perform hypothesis testing, and build regression models to understand the underlying relationships.

Authors:

Key Aspects:

R Packages Used:

View HTML Statistics

View R code

New projects, coming soon!!!

Coursera courses