About Me
Hi, I am Indraneel, a tech-first engineer building reliable data and AI/GenAI pipelines, interactive tools, and open-source systems that teams and communities can depend on. I work across R and Python, with a focus on clinical data programming, bioinformatics, cloud infrastructure, and developer experience.
My background spans pharma, bioinformatics, and product engineering. I have worked with complex biological and clinical datasets, clinical trial registries, single-cell multiomics, and regulated delivery environments. That mix shapes how I build: reproducible by default, practical for real teams, and clear enough for others to maintain.
Technical Expertise
In R, I build interactive web applications with Shiny, validated clinical programming systems, internal packages, and CRAN packages such as {llmshieldr}, {clintrialx}, and {shiny.ollama}. I work with the tidyverse, pharmaverse, testing, documentation, package development, and reproducible project environments.
In Python, I build data apps, ETL pipelines, dashboards, and LLM integrations using Streamlit, Pandas, Matplotlib, Plotly, Django, Flask, and Ollama. I also use AWS, Docker, Kubernetes, GitHub Actions, GitLab CI/CD, CircleCI, and Nextflow to ship scalable and reproducible systems.
I enjoy bringing people together to deliver high-impact projects: leading teams, mentoring engineers and clinical programmers, building internal tools, and helping organizations adopt open source, DevSecOps, and AI/GenAI practices with care.
Skills Overview
| Category | Skills |
|---|---|
| Programming Languages | R, Python, Bash, SQL |
| R Ecosystem | shiny, tidyverse, pharmaverse, seurat, signac, testthat, pkgdown, roxygen2, devtools, renv, clintrialx, llmshieldr |
| Python Ecosystem | Streamlit, Pandas, Matplotlib, Plotly, Django, Flask, Ollama |
| Web Development | HTML, CSS, Bootstrap, JavaScript |
| Operating Systems | Linux (Ubuntu), Windows, Mac |
| Version Control | Git, GitHub, GitLab, Bitbucket |
| Containerization and Orchestration | Docker, Kubernetes (basic) |
| CI/CD | GitHub Actions, CircleCI, self-hosted YAML workflows |
| Workflow Management | Nextflow |
| Development and Collaboration Tools | Atlassian suite (Bitbucket, Jira, Confluence, Trello), ClickUp, Notion |
| Cloud Services | AWS (EC2, S3, EBS, EFS, ECR, IAM, DynamoDB, CodeDeploy, CodePipeline) |
| Coding Platforms | VS Code, RStudio, Positron, Jupyter Notebooks |
| GenAI | GitHub Copilot, Claude Code, Codex, Ollama local LLM API, OpenAI API, ChatGPT, Gemini, Grok, Perplexity |
| Productivity Suites | Google Suite (Docs, Sheets, Slides, Sites, Analytics), Microsoft Office (Word, Excel, PowerPoint) |
Work Experience
Ephicacy
Senior Technical Lead - Full time
November 2024 - Present
- Led and mentored a 10-member engineering team, including 2 direct reports, and upskilled 60+ clinical programmers in advanced R and SDLC practices.
- Championed GenAI integration and open-source strategy with senior leadership, supporting external partnerships and industry thought leadership.
- Presented at PharmaSUG US 2026 in Boston on “Engineering Secure and Reproducible R-Based Clinical Programming Systems using Open Source DevSecOps Practices”; paper published in the PharmaSUG 2026 proceedings.
- Delivered a technical session at IASCT ConSPIC India 2025 in Thiruvananthapuram on DevSecOps integration in clinical R programming. Reference: eConSPIC 2025 brochure, DS_PPT_011, page 64.
- Built validated AWS R infrastructure with compliant audit trails and automated scaling, reducing provisioning time by 60%.
- Architected enterprise-grade R packages using pharmaverse and tidyverse patterns for CDISC-compliant SDTM/ADaM workflows, achieving 30% code reuse.
- Engineered Docker and GitHub Actions CI/CD pipelines, cutting manual error rates by 50%, supporting GxP compliance, and enabling regulated enterprise client work.
- Spearheaded SAS-to-R migration by refactoring legacy macros into modular, testable functions and delivering 40% faster pipeline processing.
Lama Data and Agilisium Consulting
Senior Software Engineer - Contract
November 2023 - October 2024
- Developed and maintained in-house R packages for data pipelines, improving modularity and debugging.
- Extracted and transformed data from DailyMed XML documents into tabular formats using Python.
- Developed and enhanced R Shiny dashboards for data visualization and reporting.
- Improved ETL processes with R, Python, and Docker on AWS, achieving a 30% reduction in bioinformatics data processing time.
- Optimized SQL queries on Amazon RDS, improving performance and reducing retrieval latency.
- Automated workflows with cron jobs and GitLab CI/CD, increasing operational efficiency and reducing manual errors.
Appsilon
R/Shiny Developer Consultant - B2B Contract
March 2023 - November 2023
- Developed and deployed commercial data dashboards using R Shiny, improving development and user experience through CI/CD and agile practices on Posit cloud platforms.
- Integrated Snowflake data sources in ETL pipelines to streamline data processing and improve dashboard performance.
- Built a multi-user chatbot in R Shiny using OpenAI’s GPT API, enabling users to interrogate internal data.
- Improved internal communication by setting up automated Slack alerts and bots.
- Led cross-functional discovery sessions with marketing and sales in life sciences.
- Managed an open-source community of 500+ members, providing technical support and contributing to market research.
- Authored an official Shiny.fluent tutorial on building apps with Rhino and Shiny.fluent: Build Apps with Rhino and shiny.fluent.
- Authored a blog on packages for clinical trial data.
Elucidata
Senior Bioinformatics Engineer - Solutions Engineering and Compute - Full time
June 2021 - February 2023
- Delivered dockerized solutions and maintained CI/CD pipelines to automate deployments on cloud platforms.
- Provided support and observability with Prometheus for Kubernetes clusters on AWS across production, development, and test platforms.
- Shipped product features to run data workflows with optimized resources using Nextflow, achieving 75% less completion time.
- Built an R Shiny web application enabling biomedical data analysis and visualization for B2B SaaS products.
- Provided Tier 2 and Tier 3 technical support, contributing to increased monthly active users, revenue, and Series A funding readiness.
- Led small teams on product delivery, technical mentorship, and industry-relevant coding practices.
Institute of Bioinformatics and Applied Biotechnology, Bengaluru
Research Assistant - Analysis and Management of Clinical Trial Data for Policy Insights - Full time
March 2020 - May 2021
- Worked with Dr. Gayatri Saberwal in the Policy Research Team.
- Performed audits using publicly available data from FDA, ClinicalTrials.gov, and Clinical Trials Registry - India.
- Automated data mining and web scraping using Python and R, creating up-to-date SQLite-backed ETL pipelines.
- Created dashboards with Streamlit for team communication and productivity.
- Reduced manual effort by 70% and contributed to multiple scientific publications.
Chegg Inc.
Managed Network Expert - Freelance
September 2018 - March 2020
- Active member of the Chegg global student community.
- Mentored students worldwide by answering questions in biological sciences.
Talks and Presentations
- R/Pharma GenAI Day 2026 - “‘Ignore All Previous Instructions’ and Other Things Your LLM Shouldn’t Do in Pharma.” Presented
{llmshieldr}, an R package for LLM safety guardrails across prompts and outputs in R. Recording. Artifacts. - PharmaSUG US 2026, Boston - “Engineering Secure and Reproducible R-Based Clinical Programming Systems using Open Source DevSecOps Practices.” Paper.
- IASCT ConSPIC India 2025, Thiruvananthapuram - “Integrating DevSecOps Principles into R for Clinical Data Programming: Enhancing Compliance, Reproducibility, and Efficiency.” Reference, page 64, DS_PPT_011.
Open Source Packages and Projects
- {llmshieldr} - Safety guardrails for Large Language Model workflows in pharma, covering prompt safety, output filtering, and compliance-aware LLM usage. First published on CRAN in May 2026 and reached 500+ downloads as of June 2026. CRAN.
- {clintrialx} - R package to fetch and explore clinical trials data from ClinicalTrials.gov and CTTI AACT registries. First published on CRAN in September 2024 and reached 5000+ downloads as of June 2026. Also cited as a relevant tool in a Cambridge University Press publication. CRAN. DOI.
- 30 Days of Pharmaverse - Open-source, self-paced guide to clinical data science with R using the pharmaverse ecosystem. Reached 4,537+ visits as of June 2026. Website. GitHub.
- Streamlit at Snowflake - Creator and open-source contributor, building custom features, apps, and Python packages including
StartLitto make the Streamlit and Snowflake journey easier. - The Carpentries - Lesson maintainer for the Python Novice Inflammation lesson, helping keep lessons accurate, functional, cohesive, and contributor-friendly.
Technical Reviewing
I volunteer as a technical reviewer for Packt Publishing, reviewing technical content, fixing code syntax issues, reporting missing data sources, identifying unreliable implementations, and publishing supporting code for the open-source community.
- Data-Centric Machine Learning with Python - ISBN: 9781804618127. Published February 2024. Book.
- A Handbook of Mathematical Models with Python - ISBN: 9781804616703. Published August 2023. Book.
- Applied Machine Learning for Healthcare and Life Sciences Using AWS - ISBN: 9781804610213. Published November 2022. Book.
- Machine Learning in Biotechnology and Life Sciences - ISBN: 9781801811910. Published January 2022. Book.
Education
Pondicherry University, Centre for Bioinformatics
Master of Science in Bioinformatics
2017 - 2019
CGPA: 8.51/10
Maulana Abul Kalam Azad University of Technology, West Bengal
Bachelor of Science (H) Biotechnology
2014 - 2017
CGPA: 8.46/10 | Bronze Medalist
Certifications
- AWS Cloud Technology Consultant - Amazon Web Services, 2026. Credential.
- Hands On Clinical Reporting with R - Genentech, a member of the Roche Group, 2025. Credential.
Research Publications
- Ethnic representation in interventional clinical trials run in India - The Lancet Regional Health, June 2023. Article.
- Facilitating audits of clinical trial data using documents of the Food and Drug Administration - The Journal of Scientific Practice and Integrity, December 2022. Article.
- An analysis of deficiencies in the ethics committee data of certain interventional trials registered with the Clinical Trials Registry - India - PLOS Global Public Health, October 2022. Article.
- Rare disease patients in India are rarely involved in international orphan drug trials - PLOS Global Public Health, August 2022. Article.
- CTRI requirement of prospective trial registration: Not always consistent - Indian Journal of Medical Ethics, May 2022. Article.
- Discrepancies between FDA documents and ClinicalTrials.gov for Orphan Drug-related clinical trial data - PLOS Global Public Health, April 2022. Article.
If you are building scalable systems, clinical data pipelines, AI-enabled tools, or open-source developer workflows, feel free to reach me at hello.indraneel@gmail.com.