
Rishiraj Sinharay
I'm a Data
About

I'm a Data Science professional with a strong foundation in Python, NLP, GenAI, and dashboarding, backed by a Master’s from Monash University. With experience spanning data engineering, applied machine learning, and stakeholder communication, I build end-to-end solutions that bridge technical insight with strategic impact.
My portfolio features advanced projects in AI-driven market intelligence, biomedical analytics, and real-time dashboards using Streamlit and LLMs like LLaMA. Having worked across tech and customer-facing roles, I bring a rare blend of technical depth and people-centric thinking to every project I undertake.
Resume
Education
Master of Data Science
2022-2023
Monash Univeristy, Melbourne, Australia
Bachelor of Technology, Electrical & Electronics Engineering
2016 - 2020
Manipal Institute of Technology, Manipal, India
Professional Experience
Data Science Intern
February 2025 - June 2025
Dell Technologies
- End-to-End Data Pipeline Development: Automated the extraction and integration of 1,500+ job listings and 300+ news articles using Selenium, BeautifulSoup, SerpAPI, and PyPDF2, covering 7 leading IT firms across India and Australia.
- Advanced NLP & Text Processing: Built scalable preprocessing pipelines using NLTK and Hugging Face Transformers to clean and analyze unstructured job and article content, enabling accurate topic modeling and trend analysis.
- LLM Integration for Strategic Insight: Utilized the LLaMA-3.3–70B model with FAISS-based semantic retrieval to generate contextual insights on market trends, competitor positioning, and partnership opportunities through retrieval-augmented generation (RAG).
- Interactive Dashboarding & Visualization: Developed a Streamlit-based GenAI intelligence dashboard, allowing users to filter by company, region, and analysis type and generate executive summaries with actionable recommendations.
- Strategic Impact & Insight Delivery: Delivered data-driven reports highlighting high-synergy sectors (e.g., Manufacturing, Finance, HR) and competitive advantages, supporting strategic decision-making at the leadership level.
Associate Professional Software Engineer
July 2020 - January 2022
DXC Technology
- Data Pipeline Monitoring & Optimization: Supported and monitored ETL workflows in SAP BODS, ensuring 99.9% job success rate and reducing daily batch failures by 20% through proactive issue resolution.
- Big Data Platform Operations: Provided L1 support for Apache Hadoop clusters across distributed environments, managing over 50+ nodes to maintain continuous data availability and performance.
- System Performance Enhancement: Tuned Hadoop jobs and resource configurations, improving job throughput by 25% and reducing average data processing latency during peak hours.
- Infrastructure Enhancement & Automation: Implemented enterprise software solutions using SAP BODS, reducing data retrieval times by 20% and significantly improving system performance.
- Cloud Platform Familiarity: Applied foundational knowledge of Microsoft Azure (AZ-900) to assist in data movement and cloud-based storage integration for analytics projects.
Portfolio
Explore my portfolio of data-driven projects, where I apply advanced analytics, machine learning, and visualization techniques to extract meaningful insights. From predictive modeling to real-world problem-solving, each project highlights my expertise in SQL, Python, and data science, showcasing my ability to turn raw data into actionable solutions.
- All
- Sports
- Coursework
- GenAI
- Others
Certifications
Microsoft Azure AI Essentials Professional Certificate by Microsoft and LinkedIn, 2025
Microsoft Azure Fundamentals, 2021
ITIL Foundation: ITIL 4 Edition
Introduction to Football Analytics - Hudl Statsbomb, 2025
Power BI, Internshala, 2025
Databricks Academic Accreditation - Generative AI Fundamentals
Contact
Let's connect! Feel free to reach out for collaborations, inquiries, or just to say hello.