Rishiraj Sinharay

I'm a Data

About

I'm a Data Science professional with a strong foundation in Python, NLP, GenAI, and dashboarding, backed by a Master’s from Monash University. With experience spanning data engineering, applied machine learning, and stakeholder communication, I build end-to-end solutions that bridge technical insight with strategic impact.

My portfolio features advanced projects in AI-driven market intelligence, biomedical analytics, and real-time dashboards using Streamlit and LLMs like LLaMA. Having worked across tech and customer-facing roles, I bring a rare blend of technical depth and people-centric thinking to every project I undertake.

Resume

Education

Master of Data Science

2022-2023

Monash Univeristy, Melbourne, Australia

Bachelor of Technology, Electrical & Electronics Engineering

2016 - 2020

Manipal Institute of Technology, Manipal, India

Professional Experience

Data Science Intern

February 2025 - June 2025

Dell Technologies

  • End-to-End Data Pipeline Development: Automated the extraction and integration of 1,500+ job listings and 300+ news articles using Selenium, BeautifulSoup, SerpAPI, and PyPDF2, covering 7 leading IT firms across India and Australia.
  • Advanced NLP & Text Processing: Built scalable preprocessing pipelines using NLTK and Hugging Face Transformers to clean and analyze unstructured job and article content, enabling accurate topic modeling and trend analysis.
  • LLM Integration for Strategic Insight: Utilized the LLaMA-3.3–70B model with FAISS-based semantic retrieval to generate contextual insights on market trends, competitor positioning, and partnership opportunities through retrieval-augmented generation (RAG).
  • Interactive Dashboarding & Visualization: Developed a Streamlit-based GenAI intelligence dashboard, allowing users to filter by company, region, and analysis type and generate executive summaries with actionable recommendations.
  • Strategic Impact & Insight Delivery: Delivered data-driven reports highlighting high-synergy sectors (e.g., Manufacturing, Finance, HR) and competitive advantages, supporting strategic decision-making at the leadership level.

Associate Professional Software Engineer

July 2020 - January 2022

DXC Technology

  • Data Pipeline Monitoring & Optimization: Supported and monitored ETL workflows in SAP BODS, ensuring 99.9% job success rate and reducing daily batch failures by 20% through proactive issue resolution.
  • Big Data Platform Operations: Provided L1 support for Apache Hadoop clusters across distributed environments, managing over 50+ nodes to maintain continuous data availability and performance.
  • System Performance Enhancement: Tuned Hadoop jobs and resource configurations, improving job throughput by 25% and reducing average data processing latency during peak hours.
  • Infrastructure Enhancement & Automation: Implemented enterprise software solutions using SAP BODS, reducing data retrieval times by 20% and significantly improving system performance.
  • Cloud Platform Familiarity: Applied foundational knowledge of Microsoft Azure (AZ-900) to assist in data movement and cloud-based storage integration for analytics projects.

Portfolio

Explore my portfolio of data-driven projects, where I apply advanced analytics, machine learning, and visualization techniques to extract meaningful insights. From predictive modeling to real-world problem-solving, each project highlights my expertise in SQL, Python, and data science, showcasing my ability to turn raw data into actionable solutions.

  • All
  • Sports
  • Coursework
  • GenAI
  • Others

Master of Data Science, Minor Thesis

A Novel Machine Learning Framework for Identifying Predictive Biomarkers of FGFR Targeted Therapy in Breast Cancer

News Summarization and Text-to-Speech Application

Akaike Internship Assignment

Master of Data Science, Data Visualization Project

Spotify Streaming Data Analysis

GenAI Market Intelligence Dashboard

A data-driven platform for tracking Generative AI initiatives, competitor analysis, and partnership opportunities for Dell

Bayer 04 Leverkusen’s Unbeaten Bundesliga Season

A data-driven analysis of Bayer 04 Leverkusen's unbeaten Bundesliga season under Xabi Alonso

Leicester City’s 2015/16 Title Winning Premier League Season

The 5000-to-1 Miracle: How Leicester City Won the Premier League. A Data-Driven Tribute to Jamie Vardy and Co.

Inside the Numbers: How Aitana Bonmatí Dominated the 2023 FIFA World Cup

A data-driven analysis of her performance leading to Spain's victory in the 2023 FIFA World Cup and her well deserved Golden Ball award.

Certifications

Microsoft Azure AI Essentials Professional Certificate by Microsoft and LinkedIn, 2025

Microsoft Azure Fundamentals, 2021

ITIL Foundation: ITIL 4 Edition

Introduction to Football Analytics - Hudl Statsbomb, 2025

Power BI, Internshala, 2025

Databricks Academic Accreditation - Generative AI Fundamentals

Contact

Let's connect! Feel free to reach out for collaborations, inquiries, or just to say hello.

Address

Kolkata, West Bengal, 700052

Email

rishiraj1998.rs@gmail.com