Antoine Pinto
Freelance - AI Engineer | Data Scientist
France
Data Scientist and AI Engineer specializing in the design, development, and deployment of enterprise-grade Generative AI solutions. Experienced in building custom RAG architectures, leading Gen AI projects from needs assessment to delivery, and serving as the technical referent for development teams. Portfolio of multiple self-developed AI applications and published Python packages.
Project Portfolio
Pepit'Auto
Application for easily searching for used cars on the French car market.
Gen AI Applications (B2B)
Development and maintenance of several generative AI applications as part of AI development of various companies.
Air Traffic Forecasting
Design and implementation of the first air traffic forecasting algorithm for a major international airport operator.
Trading Bot
Implementation of an algorithm that automatically buys and sells financial assets based on the predictions of a ML model.
Docu Talk - Document Chat Bot
Custom Decision Trees - ML Library
Python library that enables building Decision Trees and Random Forests with custom splitting criteria.
String Pair Finder - Python Library
String matching algorithm designed to match strings by similarity.
Sports Betting Algorithms
Algorithms to optimize sports betting decisions, based on the expected gain calculated via the training of ML models.
Staty Soccer - Journalist Bot
Algorithm for detecting the most relevant recent football statistics, creating multimedia content and publishing it.
MapReduce Clustering for Big Data
Co-authoring and development of a clustering method suitable for large data sets, based on the MapReduce method.
Technical Skills
- Dedicated a minimum of 4 hours per day to Python development
- Developed 10+ backend applications using FastAPI / Streamlit
- Applied object-oriented programming (OOP) concepts: classes, inheritance, delegation, etc
- Created several Python libraries (custom-decision-trees, string-pair-finder, easyenvi, etc.)
- Parallelization; Multiprocessing
- Delivered "dvanced Python" course to Master's students
- Proficient in SQL with PostgreSQL, MySQL, SQLite
- Skilled in NoSQL using MongoDB
- Experienced with Big Query, Cloud Storage (GCP)
- Delivered university course “Advanced SQL” (6 sessions of 3 hours each)
- Served as Generative AI referent in a company of 50+ developers
- Mastered generation and embedding models via APIs: OpenAI; Gemini; Mistral
- Implemented RAG (Retrieval Augmented Generation) architecture
- Applied advanced techniques such as Chain of Thought, Function Calling, and prompt engineering
- Delivered training courses: "Generative AI for Developers" (2 x 4 hours) and "Generative AI for HRs" (4 hours)
- Mastered regression and classification algorithms: Decision Tree, Random Forest, XGBoost, Linear/Logistic Regression, KNN
- Developed the first traffic forecast algorithm for a major airport operator (representing 5.5% of world traffic)
- Developed Custom Decision Tress, a Python library for building customizable decision trees and random forests.
- Published a scientific paper on Clustering: MapReduce Clustering For Big Data - B.Ghattas, A.Pinto, S.Diao
- Scraped 10+ websites using requests, BeautifulSoup and occasionally Selenium
- Scheduled scraping execution and data storage in databases (Big Query, MongoDB, etc.)
- Implemented proxies for automated scraping via Cloud Run instances
- Automated and maintained scraping algorithms for long-term use
- Proficient in 15+ GCP services used via interface or API
- Automated Cloud Run job and service deployment & scheduling with Cloud Scheduler
- Optimized complex nested queries in Big Query
- Deployed and managed Cloud SQL relational database
- Administered users and service accounts via Cloud IAM
- Integrated APIs including Gemini, Translation Hub, Google Maps
- Managed additional services: Cloud Storage, Cloud Tasks, Vertex AI, Artifact Registry, Google Auth, Cloud Logging
- Applied Git systematically across all development projects.
- Collaborated in teams using branches, merge requests, and other Git workflows
- Developed CI/CD pipelines for automatic application deployment.
- Utilized nested submodules to modularize applications
- Mirrored repositories to external GitLab / GitHub instances.
- Founded and administered the GitLab Gen AI group for a company with over 50 developers.
- Developed 10+ Streamlit applications in professional settings
- Built Docu Talk (Streamlit version), a web app that lets you create custom Chat Bots from your PDF documents
- Mastered native frontend elements and applied advanced CSS customizations
- Secured deployed applications using authentication systems (passwords, SSO, MSAL)
- Optimized user experience, including ergonomics and background executions
- Deployed applications to Cloud Run, AppEngine, and Streamlit Cloud
- Developed 4 frontend applications/websites using Vibe Coding
- Developed Docu Talk (React version), a web app that lets you create custom Chat Bots from your PDF documents.
- Deployed applications to Cloud Run
- Other languages: R language; VBA; Pyspark
- Office Pack: Excel / PowerPoint / Word
- VS Code / Cursor
- Notion / Trello
- Azure: AzureOpenAI, Azure Blob Storage, Azure Entra ID
- AWS SES
- Docker
References
I've had the privilege of collaborating with leading organizations across various industries.
European Digital Group
Equancy
VINCI Airports
WEFY
Clarins Group
Paris-Dauphine University
Metsys
Institut de Mathématiques de Marseille