Explore our synthetic data demonstrations showcasing how Nova Synthetic successfully transformed existing datasets into privacy-preserving synthetic versions. These demos prove our technology's ability to maintain statistical integrity while protecting sensitive information.
See how we successfully applied our
synthetic data technology to real-world datasets
Transforming data access while preserving privacy across industries
In today's world, access to quality information is an invaluable resource for science, medicine, and technological innovation. However, there is a challenge that transcends borders: How can we make the most of data without putting people's privacy at risk? This tension is not exclusive to Costa Rica or Latin America; it is a global challenge.
At Nova Synthetic, we have embraced this challenge as our mission. Our work focuses on creating synthetic datasets that preserve the statistical richness of real data while eliminating any link to specific individuals. In this way, institutions of all types can innovate, research, and generate solutions without exposing sensitive information.
We recently developed a synthetic dataset based on the historic Diabetes 130 – US Hospitals (1999–2008), comprising over 100,000 clinical records. The objective was clear: preserve the statistical fidelity of the original data while ensuring minimal risk of re-identification.
The results exceeded international standards:
In other words, we have data with the same utility as the originals for research and development, but without the risk of compromising any patient's confidentiality.
Trust in data is not achieved solely through technical metrics. That's why every project at Nova Synthetic is designed to align with international regulatory frameworks such as HIPAA, GDPR, and ISO/IEC 27559, in addition to complying with local regulations in Latin America.
This approach allows us to deliver auditable, secure datasets ready to be used by hospitals, universities, insurance companies, startups, and research laboratories, within a framework of governance and responsibility.
The impact of synthetic data is not limited to the medical sector. These datasets open new opportunities in:
From Costa Rica, Nova Synthetic seeks to demonstrate that it is possible to innovate responsibly. Our work is not just a technical contribution, but a commitment to a future where data becomes an engine of economic, social, and scientific progress.
We firmly believe that Latin America can lead this transformation. With reliable synthetic datasets, it is possible to boost public health research, strengthen the competitiveness of our institutions, and open the door to global collaborations that previously seemed unattainable.
At Nova Synthetic, we are convinced that synthetic data is not an alternative for the future, but a present tool that is already changing the way we research, innovate, and build prosperity for our communities.
Transforming fraud detection while protecting customer privacy in the financial sector
Financial fraud is one of the greatest challenges of our time. Every year, thousands of people and organizations suffer losses from increasingly sophisticated fraudulent practices. However, researching and developing solutions against fraud presents an obvious difficulty: the real data containing these signals is usually highly sensitive and protected by strict regulations.
At Nova Synthetic, we believe this challenge should not hinder innovation. Our team has demonstrated that it is possible to create high-fidelity synthetic datasets that reproduce the statistical patterns of fraud, without including personal information from any customer. In this way, research and the financial industry can advance safely and responsibly.
Our most recent work focused on the international reference dataset Bank Account Fraud (BAF). The objective was clear: preserve the complexity and natural imbalance of bank fraud data, but under minimal reidentification risk.
The results speak for themselves:
In simple terms, this is ideal data for training and validating fraud detection models, with the peace of mind that no real customer is at stake.
Security is not measured only in numbers. Every Nova Synthetic project is designed to comply with the most demanding regulations, including:
This makes Nova Synthetic a reliable ally for banks, fintechs, insurance companies, and regulatory entities, both in Costa Rica and throughout Latin America.
Synthetic data applied to fraud not only solves a technical problem but generates strategic advantages for the entire sector:
From Costa Rica, Nova Synthetic works so that Latin America positions itself as a reference in responsible innovation with data. The development of quality synthetic datasets not only drives the fight against fraud but opens the door to a more solid, secure, and competitive financial ecosystem.
We believe that the region's future depends on finding the balance between technology, ethics, and trust. Synthetic data is a key piece to achieve that balance and ensure that prosperity is shared.
At Nova Synthetic, we are convinced that synthetic data represents not just a technological solution, but a commitment to a future where financial innovation and customer protection go hand in hand, building a more secure and prosperous financial landscape for all.
Advancing cancer research while protecting patient privacy through revolutionary synthetic data technology
Breast cancer is one of the greatest public health challenges in the world. Medical research requires high-quality data to discover patterns, test hypotheses, and develop more effective treatments. However, this data is usually extremely sensitive and protected by regulations that limit its access.
At Nova Synthetic, we believe that innovation and privacy should not be in conflict. That's why we have taken an important step: generating a high-quality synthetic dataset based on the renowned Rotterdam oncological dataset.
The result of this project is not just a set of numbers: it is proof that we can create realistic and reliable clinical datasets that preserve the statistical richness of the original data without exposing the identity of any patient.
Outstanding results:
In simple terms, this means that researchers and healthcare professionals can work with synthetic data that behaves like real data, but without compromising patient confidentiality.
Synthetic datasets like this enable researchers to:
In a field like oncology, where time can make the difference between life and death, this ability to generate and share reliable information opens new possibilities for global collaboration.
Our advanced synthetic data generation pipeline utilized the SynthD system with MOSTLY AI SDK 4.7.8, featuring:
Beyond the metrics, this achievement reflects a vision: data science in service of life. With each advance in synthetic data, we move closer to a future where:
At Nova Synthetic, we know that the fight against cancer depends not only on medicine but also on the ability to generate knowledge from data. With this project, we demonstrate that it is possible to combine cutting-edge technology, ethical responsibility, and strategic vision to transform medical research.
This is one more step in our commitment: to make innovation protect and enhance human life, in Costa Rica, Latin America, and the world.