Profile Image

Massimiliano Falzari

AI Engineer & Researcher | LLMs Reinforcement Learning MLOps

About Me

I bridge the gap between AI research and production-grade engineering. With an MSc in AI from the University of Groningen and a published background in Deep Reinforcement Learning, I focus on building systems that are as theoretically sound as they are scalable.

Currently, at Wide Group, I design multimodal document processing systems using LangGraph, Kafka, and AWS across a 30+ microservice architecture. My work centers on making LLMs reliable and observable in production environments.

Beyond the code, three years as a Teaching Assistant for Deep Learning and Cognitive Robotics sharpened my ability to translate complex concepts into clear, actionable insights. A skill I carry into every architectural decision I make.

Massimiliano enjoying a drink outdoors Massimiliano with a friend in winter

I thrive on building AI solutions from the ground up—balancing high-level research with the practicality required for real-world impact.

When I'm not immersed in the world of AI, I’m usually staying active with volleyball and swimming, or strategizing over a game of chess.

Publications

Fisher-Guided Selective Forgetting: Mitigating The Primacy Bias in Deep Reinforcement Learning (2025)

  • Introduced a novel method using the Fisher Information Matrix (FIM) to identify and mitigate the primacy bias.
  • Demonstrated improved learning efficiency and performance by selectively adjusting network weights, preventing early experiences from dominating the learning process.
View ArXiv View Abstract

Upside-Down Reinforcement Learning for More Interpretable Optimal Control (2024)

  • Implemented an innovative Upside-Down Reinforcement Learning (UDRL) framework using tree-based ML methods.
  • Focused on interpretable approaches to Reinforcement Learning, demonstrating their viability in optimal control tasks.
  • Contributed to the field of interpretable RL, showing tree-based methods can offer performance on par with NNs while being more transparent.
View ArXiv

The Primacy Bias Through the Lens of the Fisher Information Matrix (2024)

  • Analysed and explained the Primacy Bias phenomena by using the Fisher Information Matrix (FIM).
  • Demonstrated how the FIM can identify critical memorization phases during training.
  • Connected the Primacy bias to the concept of Transfer Learning
View Abstract View Thesis

Autoencoder-based Deep Reinforcement Learning for ground-level walking of Human Musculoskeletal models

  • Investigated the effectiveness of using autoencoders to improve Deep Reinforcement Learning.
  • Compared undercomplete and variational autoencoders, with PPO+IL achieving best results with the undercomplete one.
  • Demonstrated how autoencoders compress high-dimensional state spaces, improving learning efficiency in complex environments.
View Thesis

Projects

Thesis Cover

The Primacy Bias Through the Lens of the Fisher Information Matrix

  • Source code available on Github for implementing forgetting mechanisms in Neural Networks.
  • Implementation uses PyTorch, Tianshou, and Gymnasium.
Upside-Down

Upside-Down Reinforcement Learning for More Interpretable Optimal Control

  • Source code available for a novel Upside-Down RL framework using tree-based ML methods.
  • Deployed demo to experiment and understand the capabilities of tree based methods in a UDRL setting
  • Implementation focuses on interpretable approaches to reinforcement learning.
Bachelor Thesis

Autoencoder-based Deep Reinforcement Learning for ground-level walking of Human Musculoskeletal models

Word inflection

Investigating LoRA for Cross-Lingual Adaptation in Word Inflection

  • Code available for fine-tuning ByT5 model using Low-Rank Adapters (LoRA).
  • Demonstrated state-of-the-art results for morphological inflection.
  • Showed benefits of pre-training with related languages.
Volleyball Website

Volleyball Statistics Website

  • Source code for a website to manage and visualise volleyball statistics.
  • Implementation includes data management and trend analysis.
  • Data organization for faster retrieval.
BEGAN

Boundary Equilibrium GAN

  • Source code replicating the Boundary Equilibrium GAN paper.
  • Implementation uses PyTorch, TorchVision, Numpy, and CUDA.
  • Addresses mode collapse issues in GANs by promoting diverse sample generation.
Echo State Network

Echo State Network for Percussion-based music generation

  • Source code for a project combining Particle Swarm Optimization and Echo State Networks.
  • Project uses Python, NumPy, SciPy, and Matplotlib.
  • Implements a method to enhance music generation tasks.

Experience

Widegroup Logo

AI Engineer & Data Scientist

Widegroup | April 2025 - Current

  • Develop end-to-end AI solutions for insurance document processing
  • Multimodal LLM systems (LangGraph, LangChain) for information extraction and analysis
  • Agent-based systems for SQL, scraping, and automation (MCP, Langfuse, LLM-as-a-judge)
  • AWS infrastructure (CDK, Lambda, ECS, MSK/Kafka) + Event-Driven architecture
  • Focus on LLMOps, DevOps, and developer tooling for 30+ microservices
BG Logo

AI Engineer

Bluegreen | March 2025 - June 2025

  • Architected and developed a Retrieval-Augmented Generation (RAG) system.
  • Integrated Langchain and LangGraph for advanced AI workflows.
  • Built interactive UIs using Streamlit.
  • Deployed with FastAPI.
ENG Logo

AI Researcher

Engineering | December 2024 - April 2025

  • Research & Innovation projects at both national and international levels
  • Implement innovative methodologies and tools to optimize projects outcomes
  • Design, develop and finetune prototypes in the context of LLMs
  • Focus on MLOps and LLMOps.
RUG Logo

Teaching Assistant

University of Groningen | April 2021 - August 2024

  • Coordinated and organized courses
  • Taught seminar groups
  • Graded assignments and exams
  • Supervised courses in:
    • Languages and Machine
    • Cognitive Robotics
    • Deep Learning
    • Reinforcement Learning
    • Deep Reinforcement Learning
Project 1

IA Consultant and Backend Developer

Basis | April 2021 - September 2021

  • Developed backend solutions
  • Brainstormed ideas on data treatment
  • Proposed potential Machine Learning approaches
  • Coordinated with frontend developers