Sitemap

A list of all the posts and pages found on the site. For you robots out there is an XML version available for digesting as well.

Pages

My CV, publications, blog, etc

Posts

Using gnomAD

1 minute read

Published:

This post is a very short learning point on how to identify pathogenic variants in a given gene. I have not yet verified whether this resource is reputable, however I hope that it provides to be a great resource. It was previously used by Joe Park in another area of the PMBB lab, an utilized in his paper in Nature here.

Beginning of a quick project

4 minute read

Published:

Below is a post to identify the first few steps in a project in which I aim to identify the proportion of pathogenic variants present in our population.

Creating a PRS based on a list of SNPs and the imputed PMBB dataset

13 minute read

Published:

This post shows the script I wrote to convert LD pruned lists of SNPs, apply them to the Penn Medicine Biobank on a specific population, and use plink -score to create Polygenic Risk scores. Below I have pasted the .sh script I wrote today to create PRS.

Performance of polygenic risk scores for cancer prediction in a racially diverse academic biobank: Methods part 2

8 minute read

Published:

This blog post is Part 2 of outlining the materials and methods used in our lab’s most recent work on generating polygenic risk scores (PRSs). This article is entitled “Performance of polygenic risk scores for cancer prediction in a racially diverse academic biobank”. It was published in October 2021 and led by Dr. Louise Wang and Heena Desai.

Genotype Imputation and TOPMed r2

5 minute read

Published:

I have never understood genotype imputation. This blog post serves to describe the concept of genetic imputation and how it’s carried out using TOPMed.

Performance of polygenic risk scores for cancer prediction in a racially diverse academic biobank: Methods Part 1

4 minute read

Published:

This blog post will be Part 1 of outlining the materials and methods used in our lab’s most recent work on generating polygenic risk scores (PRSs). This article is entitled “Performance of polygenic risk scores for cancer prediction in a racially diverse academic biobank”. It was published in October 2021 and led by Dr. Louise Wang and Heena Desai.

What is a VCF file?

2 minute read

Published:

This blog post exists because I tried to create a blog post on study methods and failed.

Initial exploration of Penn Medicine Biobank

10 minute read

Published:

Today I’m exploring the data available within PMBB. This blog is a data dump of the directories and README files, which might not be pleasant to read. However, I’m interested in copying it here as a resource that I can check back on when navigating this complicated database.

An introduction to the Penn Medicine Biobank

2 minute read

Published:

Welcome to the blog. In this first posting, I aim to describe the Penn Medicine Biobank. Although it remains an unfamiliar resource within Penn Medicine, it contains a fantastic array of electronic health record and genetic information with tremendous research potential.

portfolio

publications

talks

teaching

Teaching experience 1

Undergraduate course, University 1, Department, 2014

This is a description of a teaching experience. You can use markdown like any other post.

Teaching experience 2

Workshop, University 1, Department, 2015

This is a description of a teaching experience. You can use markdown like any other post.