Availability: Freelance, Fixed-term, Permanent — on-site or remote

Data Engineer focused Quality, Reliability & Scalability

Database design and management, Application development for data collection, Data processing and formatting for analysis, Data analysis, Design of robust ETL/ELT pipelines, Airflow orchestration

scheduling & Orchestration
Apache Airflow, CRON job
Data analysis
R, Stata, SPSS, python
Big Data
Spark, Hadoop, Hive
Data collection
Kobo, Odk, Excel (masques), Redcap, CSPRO
Infra: Docker · Kubernetes · Terraform · GCP/AWS · GitHub Actions

Recent projects

Voir tout sur GitHub

Student Social Media data analysis (Data)

Complete data analysis project on social media usage among students. The project covers the entire data science workflow, including:
→ Data ingestion (Kaggle API & CSV)
→ Processing and Visualization (pandas, matplotlib, sns),
→ Predictive Modeling (sklearn),
→ Results Communication (ipywidgets).

  • API
  • Kaggle
  • Python
  • pandas
  • matplotlib
  • sns
  • sklearn
  • ipywidgets
705 students Most used: Instagram

Suivi de candidature (python dev)

Jobs applications data entry allowing to follow and plan well job posts we want to apply
Key advantages
Be informed job application deadlines
Found or Search make easy
→ save time and Be productive.

  • Python
  • Inno setup
  • PyQt5/6
Available for free Open source

Jags modeling - R (Data)

Define and build your jags model with rjags:
key Features:
Model specification in JAGS language,
Data preparation for JAGS,
MCMC sampling with rjags,
Posterior analysis and visualization with coda and bayesplot,
Convergence diagnostics with Gelman-Rubin and traceplots.

  • R
  • Jags
  • Traceplots
  • Gelman-Rubin
  • MCMC
  • Bayesian modeling
2 diagnotics ways explained 6 steps to improve convergence

Package libelizeKobo R (Data)

R package to label and recode KoboToolbox datasets.
Key features:
Import KoboToolbox data and metadata,
Label variables and values,
Recode variables based on metadata,
→ Export cleaned datasets for analysis.

  • R
  • Packages
  • KOBO
  • ONA
  • ODK
  • mobile data collection
Lisibilité des datasets data transformation

Compétences

Langages & Outils

  • Python, SQL, Bash, R, java, VBA
  • Airflow, Spark, Kafka, Hadoop, MapReduce
  • Postgres, MySQL, MongoDB, Cassandra
  • Rstudio, Stata, SPSS
  • Looker Studio, Tableau, IBM cognos, Power BI

Infra & DevOps

  • Docker, Docker Compose, Kubernetes
  • CI/CD (GitHub Actions), Terraform
  • AWS (S3, EMR, Glue), GCP (GCS, Dataflow, BigQuery)
  • Security by design, IAM, Secrets

Méthodes

  • Architecture Data Lake/Lakehouse
  • Analyse & modélisation
  • Data quality & tests
  • Monitoring & Observabilité

Experience

  1. Junior Data Engineer — TRIMOM/MURAZ

    Data lead for the TRIMOM project, a health initiative by the MURAZ center studying triple infections (HIV, Syphilis, HBV) in pregnant women at healthcare centers in Burkina Faso. My role is to provide the best data solutions for project monitoring and success. My responsibilities include:
    — Designing the architecture of multiple cohort databases considering all relationships and objectives;
    — Developing data collection applications tailored for each service (consultation or laboratory);
    — Training field investigators;
    — Monitoring data quality;
    — Creating reports to track field investigator activity (number of submitted forms, errors detected in received forms);
    — Producing KPI reports for the project team;
    — Building ELT pipelines to automate reporting systems updates and cloud storage for the project team;
    Analyzing data for various scientific publications.

  2. Data Manager — RESTHIV/MURAZ

    RESTHIV is a national project aimed at improving care for people living with HIV (adults, children, adolescents) in partnership with SP-CNLST. As a Data Manager at the Center for Methodologies and Data Management (CMGD), my tasks included:
    — Creating and managing databases;
    — Developing mobile data collection applications;
    — Monitoring collection activities using Shiny dashboards;
    — Cleaning datasets for the team's Data Analyst.

  3. VBA Application Developer — NATIONAL POLICE (Freelance)

    To facilitate daily accident data entry, monthly, quarterly, and yearly report generation, and geographic tracking of KPIs, I developed a solution using Excel + VBA. I designed a VBA application with integrated input forms, automated report creation, and an interactive dashboard, replacing manual templates.

  4. Data Manager — ALGO-VIH/MURAZ

    ALGO-VIH aimed to improve HIV seropositivity diagnostics by testing 10 screening methods to identify the best ones. As Data Manager, I oversaw the full data collection and digitization process. I developed data collection tools (VBA & KOBO), monitored data quality (double-entry system, Shiny dashboard), and cleaned databases for analysis.

  5. Data Analyst — ONSP

    The National Public Health Observatory (ONSP) conducted a nationwide study on Burkinabé preferences for Insecticide-Treated Nets. The CMGD at MURAZ handled the data analysis. As part of the analyst team, I designed insight tables for the analysis report, performed STATA data analyses, and assisted with technical report writing.

  6. Data Analyst — Research Assistant MURAZ

    At the MURAZ research center, I supported researchers in data analysis and scientific manuscript writing. My role included preparing existing datasets to answer research questions, assisting with writing, proofreading, and editing manuscripts for journal submission.

  7. VBA Application Developer — MURAZ Center

    As a VBA developer, I created applications for the entomology department to facilitate mosquito data collection. The applications managed the full collection process, access rights, and included an integrated dashboard to monitor key indicators.

Articles & Talks

Tous les posts

Data Governance and Compliance

Practices and procedures to ensure data quality, security, and responsible use: Use cases with public health data

2025 • Sharing experiences

Suivi de candidature

Keep track of the positions you have already applied for or are planning to apply for.

2025 • improve work research

Les APIs

APIs work simply explained.

2025 • data extraction

Let's Work Together

Do you have a data need (ETL, data management or analysis, development, dashboards, optimization)? Write to me and I will get back to you very soon.

Based in Burkina Faso • Available for remote projects and open to any collaboration opportunities