San Pedro I 410 F

506 Dolorosa St

San Antonio, TX


I am an Assistant Professor in the Department of Computer Science, College of Sciences, University of Texas at San Antonio. I am the founder and lead faculty of the Cohort for AI REsponsibility (CARE) at UTSA. I am also a core faculty in School of Data Science at UTSA.

Prior to joining UTSA, I was a postdoctoral researcher at College of Information and Computer Science, University of Massachusetts, Amherst, as a member of the Data systems Research for Exploration, Analytics, and Modeling (DREAM) lab and of the Center for Data Science. I had received a postdoctoral fellowship from the CDS at UMass.

I obtained my Ph.D. from New York University, under the supervision of Prof. Julia Stoyanovich. I have received a Pearl Brownstein Doctoral Research Award from the Tandon School of Engineering at NYU. Details of my Ph.D. research can be found at DataResponsibly.

Research Interests

My work is broadly in the areas of AI responsibility, data management, machine learning, and human- centered data science. In particular, I have focused on topics such as algorithmic fairness, diversity, transparency, and algorithmic accountability. Other areas of focus include AI and machine learning education and public engagement.

Professional Experience

Open Source Tools

  • Mirror Data Generator
    • A python script generates synthetic data to mirror issues, such as sampling and societal bias. The issues are described by the correlation between features.
  • Ranking Facts
    • A web-based tool generates a ``nutritional label’’ for rankings. Each label shows a fact about the ranking. For example, a fact about fairness explains whether the ranking shows statistical parity between groups that are defined by a user-specified feature.
  • FairDAGs
    • A web-based tool extracts directed acyclic graph (DAG) representation of data science pipelines and tracks the changes of the distributions of targets and groups due to each operation. The groups are often defined by a user-specified feature in the dataset.

Last Updated on 09/01/2023