Pranava Upparlapalli

Pranava Upparlapalli

As a bioinformatics scientist (M.S.), I thrive on partnering with research teams to solve their most pressing data challenges. I build effective, automated pipelines using Python, R, and Nextflow to support and advance their data-driven scientific goals.

"Bridging biology and data with clarity and purpose."

Areas of Expertise

NGS Icon

Next-Generation Sequencing (NGS) Data Processing

Develop end-to-end pipelines for RNA-Seq, DNA-Seq, and targeted sequencing to transform raw reads into high-confidence, interpretable biological data.

Genomics Icon

Genomic Data Analysis and Interpretation

Analyze large-scale genomic datasets to interpret gene regulation, assess variant effects, and uncover clinically meaningful trait associations.

Data Science Icon

Bioinformatics and Statistical Data Analysis

Apply rigorous statistical methods and bioinformatics tools to generate accurate, reproducible insights from complex omics datasets.

Machine Learning Icon

Machine Learning Applications in Computational Biology

Design and deploy ML models for classification, prediction, and dimensionality reduction, adapted to the intricacies of biological data.

GWAS-TWAS Icon

GWAS and TWAS Pipeline Development

Build integrative workflows combining genotype, expression, and chromatin interaction data using PLINK, PrediXcan, and MetaXcan for trait-gene mapping.

Pipeline Automation Icon

Workflow Automation and Scalable Pipeline Design

Develop modular, version-controlled pipelines using Snakemake, Nextflow, and shell scripting to automate and scale bioinformatics workflows.

Work Experience

Graduate Researcher – TWAS & Genomic Modeling

Dr. Xuan’s Lab, University of Texas at Dallas (Jan 2025 – Present)

Developed and tested a GWAS-driven machine learning pipeline using PrediXcan to predict tissue-specific gene expression, leveraging eQTL models and the GTEx database.

Analyzed trans-acting SNPs using Hi-C data to explore long-range chromatin interactions and their impact on genotype–phenotype associations.

Built and optimized reproducible data analysis pipelines using Bash scripting and executed workflows on high-performance computing (HPC) systems.

Undergraduate Researcher – Biochemical Assay Development

Sree Vidyanikethan Degree College, Tirupati (Aug 2020 – Mar 2021)

Investigated and characterized the antioxidant and antibacterial properties of Biancaea Sappan using advanced biochemical assays.

Formulated specialized media and optimized growth conditions to improve compound yield and bacterial inhibition profiles.

Performed lab experiments that enhanced extraction efficiency by 15%, and tested antibacterial activity across varying extract concentrations.

Projects

SeqMorph: Mutation Analysis Tool

A Python-based tool to analyze and visualize DNA/protein mutations with support for functional impact prediction.

Python · Biopython · Pandas
GitHub

Cancer RNA-Seq Expression Analysis

Analyzed gene expression across five cancer types using DESeq2, PCA, and clustering to identify expression patterns and biomarkers.

R · DESeq2 · ggplot2 · pheatmap · PCA · Clustering
GitHub

Nociception Study using Secondary Metabolites

Studied nociceptive effects of gut microbiota–derived secondary metabolites, linking microbial activity to pain signaling pathways.

AntiSMASH · PCR · Microbial techniques
GitHub

Yeast Stress RNA-Seq Pipeline

Nextflow-based pipeline for differential gene expression analysis under oxidative stress in yeast.

Nextflow · DESeq2 · Docker · MultiQC
GitHub

Gleason Score Classification (DL)

ResNet50-based classifier trained on histopathological images to predict prostate cancer severity.

TensorFlow · Keras · Python
GitHub

DDSEQ2 RNA-Seq Analysis

Differential gene expression pipeline in R using DESeq2 with custom visualization outputs.

R · DESeq2 · ggplot2
GitHub

Skills

Python
R
Shell
PySpark
Git
Nextflow
Snakemake
Docker
GitHub
FastQC
DESeq2
edgeR

Bioinformatics Tools

  • FeatureCounts
  • GSEA
  • MetaXcan
  • PLINK
  • Trimmomatic

Programming

  • Python
  • R
  • Bash / Shell
  • SQL
  • PySpark

Workflow & Infra

  • Snakemake
  • Nextflow
  • Git
  • Docker
  • AWS

Databases

  • NCBI
  • GEO
  • Ensembl
  • UCSC Genome Browser
  • KEGG

Wet Lab Skills

  • DNA/RNA Extraction
  • qPCR
  • Western Blotting
  • Gel Electrophoresis
  • MIC Assay
  • ELISA
  • Antibacterial Testing
  • Protein Docking

Soft Skills

  • Scientific Communication
  • Problem Solving
  • Reproducibility Practices
  • Time Management
  • Attention to Detail
  • Team Collaboration

Education

MS: Bioinformatics and Computational Biology

University of Texas at Dallas (UTD) — Expected May 2025
GPA: 3.20 / 4.00
Relevant Coursework:
• Applied Bioinformatics
• Probability, Statistics, and Data Science in Bioinformatics
• Molecular Biology
• Algorithms and Data Structures
• Combinatorics and Graph Theory
• Medical Image Analysis
• Introduction to Big Data Analytics

Advanced Diploma: Bioinformatics

Bharati Vidyapeeth University (BVDU) — 2022
GPA: 9.14/10
Relevant Coursework:
• Biological Informatics
• Biostatistics
• Data Mining through Machine Learning
• Advanced Bioinformatics
• Science of Omics
• Molecular Modeling & Drug Designing
• R & Data Analytics
• Java & BioJava Programming

BSc: Microbiology, Biochemistry, Chemistry

Sri Venkateswara University (SVU) — May 2020
GPA: 8.20/10
Relevant Coursework:
• Microbial Physiology
• Medical Microbiology
• Immunology
• Biomolecules
• Biotechnology
• Molecular Biology
• Biochemistry
• Organic and Inorganic Chemistry