Samuel Bunga

samuelbunga.github.io

bunga.s@northeastern.edu

samuelbunga

samuelbunga

thebluenerve

Boston, Massachusetts 02130 USA

857-701-9190

Interests

Single Cell Genomics , Network inference , Bioinformatics software development

Skills

Programming/Development: Python, JavaScript, R, Shell Scripting, Nextflow, PHP, CI/CD, Git, AWS

Education

Northeastern University, Boston, MA

M.S., Bioinformatics

  • Computational Methods – 1 and 2, Bioinformatics Programming, Machine Learning, Data Analytics

August 2018 - May 2020

Work

Bioinformatics Scientist 2,

Illumina, Inc., San Diego, CA

  • Worked on contributing code and developing Apps like GISAID uploader, Microbiome Metagenomics, Microbial Enrichment Analysis, DRAGEN COVID Lineage, DRAGEN Viral Lineage and DRAGEN Targeted Microbial
  • Developed various Python modules which are being used internally in different DRAGEN Apps

February 2022 - Present

Bioinformatics Software Developer, Harvard Program in Therapeutic Science (HiTS)

Harvard Medical School, Boston, MA

  • Developed Neuro Immune interactome pipeline using Nextflow and Python to study the interactions in different mouse pain models between Immune cell ligands and Neuron receptors using INDRA network and cellphonedb (github.com/indralab/panacea_indra)
  • Designed and developed a workflow using COSMOS to integrate and connect data from different omic (Phospho-proteomics and Transcriptomics) datasets to make network level causal reasoning in CFA and Naïve sciatic nerve samples (github.com/samuelbunga/panacea_indra/tree/master/proteomics/cosmos/scripts)
  • Created workflows for pre-processing in-Drops and 10X single-cell RNA-Seq raw samples using bcbio pipelines and cellranger respectively
  • Contributed code and maintained bulk and single-cell RNA-Seq pipelines for the bioinformatics core team at Boston Childrens Hospital
  • • Contributed code in fixing bugs and making significant improvements to CXP-lite (Profiling neuronal activity from fluorescence imaging data) tool which potentially saved hours of work for researchers at Woolf lab

August 2020 - February 2022

Graduate Teaching Assistant, Computational Methods-2

Northeastern University, Boston, MA

  • Co-supervise and conduct lab sessions for 30 students and held office hours every week
  • Provide comprehensive academic support and mentored students on individual basis
  • Conduct and evaluate student quizzes and assignments

September 2019 - April 2020

Research Intern, Bhasin Lab

Harvard Medical School, Boston, MA

  • Designed and developed a free to use web application (Single Cell Wizard) for processing 10X, CelSeq or DropSeq Single Cell samples
  • Developed home brew tools for Single Cell Wizard pipeline like FASTQ demultiplexer and FASTQ cleaner to remove singleton nucleotides and low-quality reads from each sample
  • Assembled and created a Database for Gene expression markers that define cell types which can be used to annotate clusters of single cells via Gene Set Enrichment Analysis

May 2019 - December 2019

Project Experience

Single Cell Wizard, Bhasin Lab

bhasinlab.us/scw_page/scwiz.php

Single Cell RNA-seq pre/post processing pipeline

  • Single Cell Wizard is a comprehensive Single Cell RNA-Seq processing web-application which is a totally automated pipeline that can take raw FASTQ files, process them to generate feature barcode matrices and perform pre-processing, data normalization, quality control to differentially expressed genes, annotation of clusters and final visualization

August 2019 - December 2019

Indian Ayurvedic Bioflavonoids Database, Northeastern University

  • Compiled over 100 plant metabolites in Ayurvedic formulations and forming up an interactome which could serve as a lead for potential drug targets that can be taken up for clinical trials

January 2019 - March 2019

References

Available upon request