Skip to content

Data Science & Analytics

Open-source tools cited in Nature, Cancer Discovery, and 51 other peer-reviewed publications

The Problem

Researchers spend hours wrestling with geographic data that should take minutes. ZIP code lookups require expensive APIs. Distance calculations need manual geocoding. Demographic overlays mean stitching together multiple data sources. Time that should go toward research goes toward data prep instead.

Gavin Rozzi builds open-source tools that eliminate this friction. His R package zipcodeR has been cited in 53+ peer-reviewed publications—including Nature and Cancer Discovery—enabling breakthrough research in public health, epidemiology, and environmental science.

How This Informs Policy Decisions

These technical capabilities translate directly into better policy outcomes. The NJ HOMES Choice Tool uses geospatial analysis to help 564 municipalities understand their affordable housing obligations. The dashboards we build at DCA don't just display data—they inform decisions about housing allocation, compliance tracking, and resource distribution across the state.

When cancer researchers need to analyze ZIP code-level data, zipcodeR eliminates weeks of data preparation. The package has been cited in studies of childhood cancer clusters, environmental health disparities, and COVID-19 outcomes. Tools built for one purpose end up enabling research that informs public health policy nationwide.

At DCA, we use the same analytical approaches to measure program effectiveness. If a housing program isn't reaching its target communities, the data shows it. If compliance reporting takes too long, we redesign the system. Evidence-based decisions require evidence infrastructure, and building that infrastructure is core to what I do.

Data Science Projects

NJ Civil Service Navigator logo with compass design

NJ Civil Service Navigator

Web platform making 5,128+ NJ Civil Service job specifications searchable and accessible for job seekers, HR professionals, and hiring managers.

workforce modernizationgovernmentcivic tech
NJ HOMES Choice Tool homepage with Unit Mix Planning interface

NJ HOMES Choice Tool

Interactive planning tool implementing A4/S50 affordable housing calculations for all 564 municipalities, supporting NJ HOMES grantmaking and municipal compliance.

affordable housingplanning toolcompliance
Winter Termination Program Self-Certification electronic form interface

Winter Termination Program Digital Transformation

Digital transformation of New Jersey's utility shutoff protection self-certification process, replacing a manual PDF with an accessible electronic form.

Digital TransformationGovernment ServicesAccessibility
Operation Right Answer service modernization concept showing AI-assisted contact center technology

Operation Right Answer: Housing Services Modernization

Led development of a data-driven service framework to modernize how New Jersey housing programs serve residents, partnering with the New Jersey Innovation Authority to implement AI-assisted contact center technology.

Amazon ConnectGovernment InnovationCustomer Service
NJ Eviction Guide logo

NJ Eviction Guide

Interactive self-help tool connecting New Jersey's most vulnerable residents directly to housing assistance and legal resources.

Next.jsReactTypeScript
Bringing Veterans Home initiative logo

Bringing Veterans Home Digital Infrastructure

Statewide digital infrastructure supporting New Jersey's initiative to end veteran homelessness—data systems, electronic referrals, and public website development.

Government ServicesData InfrastructureWeb Development
Municipal Lead Reporting Portal interface showing the statewide compliance dashboard

Municipal Lead Reporting Portal

Statewide compliance platform capturing residential lead-paint inspection data for all 564 New Jersey municipalities.

Data CollectionComplianceGIS
Hexagonal map of New Jersey showing opioid overdose hotspots identified through spatial analysis

New Jersey Opioid Overdose Spatial Analysis

Grant-funded spatial analysis identifying opioid overdose hotspots across New Jersey using state administrative data and advanced geospatial methods, conducted as a Research Affiliate at Rutgers.

Spatial AnalysisGISPublic Health
3D visualization of New Jersey population density showing elevated ridges for densely populated urban areas like Newark and Jersey City

New Jersey Population Density Map

Award-winning 3D visualization of New Jersey population density using rayshader, winning First Place in the 3D category at the NJ DEP GIS Mapmaking Contest.

data visualizationRGIS
njgeo hex sticker logo

njgeo

An R package for geocoding addresses using New Jersey's official geocoding service, freely available as an alternative to commercial solutions.

Ropen sourcegeospatial
njtr1 hex sticker logo

njtr1

An R package that makes it easy to download and analyze New Jersey motor vehicle crash data for transportation research and safety analysis.

Ropen sourcetransportation
TrentonTracker: Making NJ Legislative Data Accessible

TrentonTracker

A modern Progressive Web App making New Jersey legislative data accessible and searchable, with ZIP code-based legislator lookup.

civic techtransparencyJavaScript
COVID-19 spread visualization map showing county-level case data across the United States

COVID-19 Spread Visualization

Interactive web-based visualization tracking the spread of COVID-19 across U.S. counties using advanced GIS technologies and real-time data processing.

GISdata visualizationpublic health

NJ Narcan Dashboard

An interactive dashboard tracking opioid overdose interventions across New Jersey through law enforcement Narcan deployment data.

public healthdata visualizationR
zipcodeR hex sticker logo

zipcodeR

An R package with 53+ peer-reviewed citations including Nature, Cancer Discovery, and MIS Quarterly, enabling breakthrough research in public health, epidemiology, and environmental science.

Ropen sourcegeospatial
OPRAmachine logo - New Jersey's statewide public records request platform

OPRAmachine

New Jersey's first statewide freedom of information platform, processing over 75,000 public records requests and releasing 250GB of government data.

civic techgovernment transparencypublic records

Publications

View all publications →

Articles & Tutorials

Frequently Asked Questions

What is zipcodeR and what can it do?

zipcodeR is an R package for working with U.S. ZIP code data, created by Gavin Rozzi. It provides functions for looking up ZIP code information, calculating distances between ZIP codes, finding ZIP codes within a radius, and accessing demographic data. The package has 115,000+ total downloads on CRAN.

What programming languages does Gavin Rozzi use for data science?

Gavin Rozzi primarily uses R for data science, including package development, statistical analysis, and data visualization with ggplot2. He also works with SQL, Python, and various GIS tools for geospatial analysis.

What is geospatial analysis and how is it used?

Geospatial analysis involves working with geographic data to understand spatial patterns and relationships. It includes GIS mapping, spatial statistics, boundary analysis, and geocoding. Applications include demographic research, public health analysis, and government resource allocation.

How can data science improve government decision-making?

Data science improves government decision-making by providing evidence-based insights that inform operational decisions. At NJ DCA, this means building systems that measure program effectiveness and track compliance outcomes—turning data into measurable impact for residents.

What makes Gavin Rozzi's approach to data science unique?

Gavin Rozzi combines technical data science skills with deep domain expertise in government and public policy. He creates open-source tools used by researchers nationwide, publishes peer-reviewed research, and applies data science to inform decision-making within state government—bridging academic rigor with implementation capacity.

About the Author

Gavin Rozzi

Gavin Rozzi

Gavin Rozzi is a civic technologist, data scientist, and digital transformation executive based in New Jersey. He leads technology initiatives at the NJ Department of Community Affairs and has created widely-used open-source tools including OPRAmachine and zipcodeR.