About Me

Hi, I am Indraneel, a tech-first engineer building reliable data and AI/GenAI pipelines, interactive tools, and open-source systems that teams and communities can depend on. I work across R and Python, with a focus on clinical data programming, bioinformatics, cloud infrastructure, and developer experience.

My background spans pharma, bioinformatics, and product engineering. I have worked with complex biological and clinical datasets, clinical trial registries, single-cell multiomics, and regulated delivery environments. That mix shapes how I build: reproducible by default, practical for real teams, and clear enough for others to maintain.

Technical Expertise

In R, I build interactive web applications with Shiny, validated clinical programming systems, internal packages, and CRAN packages such as {llmshieldr}, {clintrialx}, and {shiny.ollama}. I work with the tidyverse, pharmaverse, testing, documentation, package development, and reproducible project environments.

In Python, I build data apps, ETL pipelines, dashboards, and LLM integrations using Streamlit, Pandas, Matplotlib, Plotly, Django, Flask, and Ollama. I also use AWS, Docker, Kubernetes, GitHub Actions, GitLab CI/CD, CircleCI, and Nextflow to ship scalable and reproducible systems.

I enjoy bringing people together to deliver high-impact projects: leading teams, mentoring engineers and clinical programmers, building internal tools, and helping organizations adopt open source, DevSecOps, and AI/GenAI practices with care.

Skills Overview

Category Skills
Programming Languages R, Python, Bash, SQL
R Ecosystem shiny, tidyverse, pharmaverse, seurat, signac, testthat, pkgdown, roxygen2, devtools, renv, clintrialx, llmshieldr
Python Ecosystem Streamlit, Pandas, Matplotlib, Plotly, Django, Flask, Ollama
Web Development HTML, CSS, Bootstrap, JavaScript
Operating Systems Linux (Ubuntu), Windows, Mac
Version Control Git, GitHub, GitLab, Bitbucket
Containerization and Orchestration Docker, Kubernetes (basic)
CI/CD GitHub Actions, CircleCI, self-hosted YAML workflows
Workflow Management Nextflow
Development and Collaboration Tools Atlassian suite (Bitbucket, Jira, Confluence, Trello), ClickUp, Notion
Cloud Services AWS (EC2, S3, EBS, EFS, ECR, IAM, DynamoDB, CodeDeploy, CodePipeline)
Coding Platforms VS Code, RStudio, Positron, Jupyter Notebooks
GenAI GitHub Copilot, Claude Code, Codex, Ollama local LLM API, OpenAI API, ChatGPT, Gemini, Grok, Perplexity
Productivity Suites Google Suite (Docs, Sheets, Slides, Sites, Analytics), Microsoft Office (Word, Excel, PowerPoint)

Work Experience

Ephicacy

Senior Technical Lead - Full time

November 2024 - Present

  • Led and mentored a 10-member engineering team, including 2 direct reports, and upskilled 60+ clinical programmers in advanced R and SDLC practices.
  • Championed GenAI integration and open-source strategy with senior leadership, supporting external partnerships and industry thought leadership.
  • Presented at PharmaSUG US 2026 in Boston on “Engineering Secure and Reproducible R-Based Clinical Programming Systems using Open Source DevSecOps Practices”; paper published in the PharmaSUG 2026 proceedings.
  • Delivered a technical session at IASCT ConSPIC India 2025 in Thiruvananthapuram on DevSecOps integration in clinical R programming. Reference: eConSPIC 2025 brochure, DS_PPT_011, page 64.
  • Built validated AWS R infrastructure with compliant audit trails and automated scaling, reducing provisioning time by 60%.
  • Architected enterprise-grade R packages using pharmaverse and tidyverse patterns for CDISC-compliant SDTM/ADaM workflows, achieving 30% code reuse.
  • Engineered Docker and GitHub Actions CI/CD pipelines, cutting manual error rates by 50%, supporting GxP compliance, and enabling regulated enterprise client work.
  • Spearheaded SAS-to-R migration by refactoring legacy macros into modular, testable functions and delivering 40% faster pipeline processing.

Lama Data and Agilisium Consulting

Senior Software Engineer - Contract

November 2023 - October 2024

  • Developed and maintained in-house R packages for data pipelines, improving modularity and debugging.
  • Extracted and transformed data from DailyMed XML documents into tabular formats using Python.
  • Developed and enhanced R Shiny dashboards for data visualization and reporting.
  • Improved ETL processes with R, Python, and Docker on AWS, achieving a 30% reduction in bioinformatics data processing time.
  • Optimized SQL queries on Amazon RDS, improving performance and reducing retrieval latency.
  • Automated workflows with cron jobs and GitLab CI/CD, increasing operational efficiency and reducing manual errors.

Appsilon

R/Shiny Developer Consultant - B2B Contract

March 2023 - November 2023

  • Developed and deployed commercial data dashboards using R Shiny, improving development and user experience through CI/CD and agile practices on Posit cloud platforms.
  • Integrated Snowflake data sources in ETL pipelines to streamline data processing and improve dashboard performance.
  • Built a multi-user chatbot in R Shiny using OpenAI’s GPT API, enabling users to interrogate internal data.
  • Improved internal communication by setting up automated Slack alerts and bots.
  • Led cross-functional discovery sessions with marketing and sales in life sciences.
  • Managed an open-source community of 500+ members, providing technical support and contributing to market research.
  • Authored an official Shiny.fluent tutorial on building apps with Rhino and Shiny.fluent: Build Apps with Rhino and shiny.fluent.
  • Authored a blog on packages for clinical trial data.

Elucidata

Senior Bioinformatics Engineer - Solutions Engineering and Compute - Full time

June 2021 - February 2023

  • Delivered dockerized solutions and maintained CI/CD pipelines to automate deployments on cloud platforms.
  • Provided support and observability with Prometheus for Kubernetes clusters on AWS across production, development, and test platforms.
  • Shipped product features to run data workflows with optimized resources using Nextflow, achieving 75% less completion time.
  • Built an R Shiny web application enabling biomedical data analysis and visualization for B2B SaaS products.
  • Provided Tier 2 and Tier 3 technical support, contributing to increased monthly active users, revenue, and Series A funding readiness.
  • Led small teams on product delivery, technical mentorship, and industry-relevant coding practices.

Institute of Bioinformatics and Applied Biotechnology, Bengaluru

Research Assistant - Analysis and Management of Clinical Trial Data for Policy Insights - Full time

March 2020 - May 2021

  • Worked with Dr. Gayatri Saberwal in the Policy Research Team.
  • Performed audits using publicly available data from FDA, ClinicalTrials.gov, and Clinical Trials Registry - India.
  • Automated data mining and web scraping using Python and R, creating up-to-date SQLite-backed ETL pipelines.
  • Created dashboards with Streamlit for team communication and productivity.
  • Reduced manual effort by 70% and contributed to multiple scientific publications.

Chegg Inc.

Managed Network Expert - Freelance

September 2018 - March 2020

  • Active member of the Chegg global student community.
  • Mentored students worldwide by answering questions in biological sciences.

Talks and Presentations

  • R/Pharma GenAI Day 2026 - “‘Ignore All Previous Instructions’ and Other Things Your LLM Shouldn’t Do in Pharma.” Presented {llmshieldr}, an R package for LLM safety guardrails across prompts and outputs in R. Recording. Artifacts.
  • PharmaSUG US 2026, Boston - “Engineering Secure and Reproducible R-Based Clinical Programming Systems using Open Source DevSecOps Practices.” Paper.
  • IASCT ConSPIC India 2025, Thiruvananthapuram - “Integrating DevSecOps Principles into R for Clinical Data Programming: Enhancing Compliance, Reproducibility, and Efficiency.” Reference, page 64, DS_PPT_011.

Open Source Packages and Projects

  • {llmshieldr} - Safety guardrails for Large Language Model workflows in pharma, covering prompt safety, output filtering, and compliance-aware LLM usage. First published on CRAN in May 2026 and reached 500+ downloads as of June 2026. CRAN.
  • {clintrialx} - R package to fetch and explore clinical trials data from ClinicalTrials.gov and CTTI AACT registries. First published on CRAN in September 2024 and reached 5000+ downloads as of June 2026. Also cited as a relevant tool in a Cambridge University Press publication. CRAN. DOI.
  • 30 Days of Pharmaverse - Open-source, self-paced guide to clinical data science with R using the pharmaverse ecosystem. Reached 4,537+ visits as of June 2026. Website. GitHub.
  • Streamlit at Snowflake - Creator and open-source contributor, building custom features, apps, and Python packages including StartLit to make the Streamlit and Snowflake journey easier.
  • The Carpentries - Lesson maintainer for the Python Novice Inflammation lesson, helping keep lessons accurate, functional, cohesive, and contributor-friendly.

Technical Reviewing

I volunteer as a technical reviewer for Packt Publishing, reviewing technical content, fixing code syntax issues, reporting missing data sources, identifying unreliable implementations, and publishing supporting code for the open-source community.

  • Data-Centric Machine Learning with Python - ISBN: 9781804618127. Published February 2024. Book.
  • A Handbook of Mathematical Models with Python - ISBN: 9781804616703. Published August 2023. Book.
  • Applied Machine Learning for Healthcare and Life Sciences Using AWS - ISBN: 9781804610213. Published November 2022. Book.
  • Machine Learning in Biotechnology and Life Sciences - ISBN: 9781801811910. Published January 2022. Book.

Education

Pondicherry University, Centre for Bioinformatics

Master of Science in Bioinformatics

2017 - 2019

CGPA: 8.51/10

Maulana Abul Kalam Azad University of Technology, West Bengal

Bachelor of Science (H) Biotechnology

2014 - 2017

CGPA: 8.46/10 | Bronze Medalist

Certifications

  • AWS Cloud Technology Consultant - Amazon Web Services, 2026. Credential.
  • Hands On Clinical Reporting with R - Genentech, a member of the Roche Group, 2025. Credential.

Research Publications

  • Ethnic representation in interventional clinical trials run in India - The Lancet Regional Health, June 2023. Article.
  • Facilitating audits of clinical trial data using documents of the Food and Drug Administration - The Journal of Scientific Practice and Integrity, December 2022. Article.
  • An analysis of deficiencies in the ethics committee data of certain interventional trials registered with the Clinical Trials Registry - India - PLOS Global Public Health, October 2022. Article.
  • Rare disease patients in India are rarely involved in international orphan drug trials - PLOS Global Public Health, August 2022. Article.
  • CTRI requirement of prospective trial registration: Not always consistent - Indian Journal of Medical Ethics, May 2022. Article.
  • Discrepancies between FDA documents and ClinicalTrials.gov for Orphan Drug-related clinical trial data - PLOS Global Public Health, April 2022. Article.

If you are building scalable systems, clinical data pipelines, AI-enabled tools, or open-source developer workflows, feel free to reach me at hello.indraneel@gmail.com.