About Me

Hello! I'm Ruthvika Reddy Tangirala.

I'm a Data Science and Analytics graduate from Northeastern University (July 2025), driven by a strong foundation in statistical analysis, machine learning, and AI engineering, combined with practical experience across diverse industries including automotive, healthcare, and enterprise IT.

At Mercedes-Benz Research & Development North America, I contributed to projects that enhanced data visibility, compliance, and operational efficiency, building end-to-end data pipelines, designing scalable dashboards using Power BI and SQL Server, and automating the transformation of vehicle test data for business intelligence reporting. I also developed a GenAI-powered pipeline using proprietary language models to extract, translate, and summarize decades of engineering documentation, significantly accelerating access to historical insights and enabling semantic search across technical archives.

Previously at IBM, I worked on chatbot development and dashboard creation for major healthcare clients, using tools like Salesforce Cloud and ServiceNow to support production monitoring and data-driven decision-making.

I bring a versatile skill set in Python, SQL, R, Power BI, and cloud platforms including Azure, AWS, and GCP, with experience spanning ETL automation, dashboard design, statistical modeling, and ML application development. Whether it’s crafting data visualizations, optimizing workflows, or applying AI to solve real-world problems, I aim to deliver solutions that drive measurable impact.

I’m currently seeking full-time opportunities as a Data Scientist, Data Analyst, Machine Learning Engineer, AI Engineer, or Data Engineer, where I can continue building intelligent, scalable, and insightful data solutions.

Education

Northeastern University

Master of Science in Data Science and Analytics; GPA: 3.92/4.0
September 2023 - July 2025 Boston, United States

Relevant Coursework:
  • Database Management
  • Data Mining
  • Data Visualization
  • Machine Learning
  • Machine Learning Operations
  • Data Warehousing

SASTRA Deemed to be University

Bachelor of Technology in Electrical and Electronics Engineering
June 2018 - July 2022 Thanjavur, India

Relevant Coursework:
  • Machine Learning Techniques
  • Artificial Intelligence
  • Data Analytics with Python
  • Python with Web Frameworks

Experience

Mercedes-Benz Research & Development North America

Data Scientist
September 2024 - May 2025 Farmington, MI, USA

  • Engineered a GenAI-powered pipeline using proprietary LLMs to extract, translate (German to English), and summarize 20+ years of meeting notes, enabling semantic search and reducing manual lookup time by 90%
  • Developed a Retrieval-Augmented Generation (RAG) system powered by AI agents and LLMs to perform reasoning over internal data sources, cutting manual analysis effort by over 50%, and enabling contextual, multi-source answers to complex user queries
  • Designed a Power BI dashboard with a Microsoft SQL Server backend to track CARB approval stages and KPIs for OBD groups, with a custom Python GUI for real-time data updates, streamlining compliance visibility across engineering teams
  • Automated a configurable data ingestion and transformation pipeline using a custom Python GUI and backend scripts, automating the flow of multi-source vehicle test data (INCA, GST, Emissions) into Microsoft SQL Server for business intelligence reporting
  • Collaborated with cross-functional and globally distributed teams to improve documentation standards and streamline Agile workflows, ensuring the reliability of automated processes

Northeastern University

Teaching Assistant - Data Science
September 2023 - April 2025, Part-time Boston, MA, USA

  • Guided a class of 700 students through their learning journey through assistance with assignments, labs, projects, office hours and in-class activities. Supported grading, code reviews, and offered guidance to foster student growth and understanding.

IBM

Salesforce Data Scientist
Febraury 2022 - August 2023, Full-time Hyderabad, India

  • Executed complex SQL queries to extract critical object data and data records, facilitating enhanced data-driven decision making and significantly improving system performance
  • Constructed interactive dashboard through data ingestion using ServiceNow data (incidents and RITM records), to monitor production-level bugs and issues, providing weekly and monthly insights for proactive resolution and optimization
  • Spearheaded development of an efficient Chatbot for 'Blue Cross Blue Shield' in Salesforce Cloud, using over 50 tailored instances per question for training and integrating machine learning for quick, accurate customer responses and streamlined case creation
  • Engineered the Broker Portal’s UI of “Blue Cross Blue Shield” using LWC and backend by creating Apex classes in the virtual AWS Environment.

Projects

Personal Trainer AI Agent
Personal Trainer AI Agent

Developed a GenAI fitness assistant using RAG and Pinecone to deliver personalized, goal-driven workout plans with real-time context via a multi-agent system.

EEG Seizure Detection
EEG Seizure Detection

Classified EEG signals data using Random Forest, XGBoost, and RNN models, achieving up to 93% accuracy in seizure detection.

Customer Segmentation using RFM Analysis
Customer Segmentation using RFM Analysis

Segmented customers into four distinct groups using RFM analysis, applying K-Means clustering to develop targeted marketing strategies that significantly enhance customer engagement.

Crime Data Analysis
Crime Data Analysis

Crime data analysis of Los Angeles from 2020 to present, revealing significant insights into crime trends, seasonal patterns, and forecasting future crime rates.

Analysis on Traffic Stops
Analysis on Traffic Stops

Analyzed Montogomery County's traffic violation data to uncover trends and patterns in road safety using Tableau.

US Regional Sales Analysis
US Regional Sales Analysis

Delving into the sales journey from initial order through delivery involving data operations to reveal consumer behaviors and preferences.

Prediction of Census Income
Prediction of Census Income

Built a predictive model to determine individual's income brackets based on socioeconomic attributes.

US Regional Sales Analysis
Hospital Database Management

Developed a hospital management database to optimize healthcare delivery, administrative efficiency, and patient care.

Skills

Languages

vectorlogo.zone vectorlogo.zone vectorlogo.zone upload.wikimedia.org vectorlogo.zone

Databases

vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone

Data Engineering

vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone

Cloud

vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone

Data Visualization

vectorlogo.zone vectorlogo.zone vectorlogo.zone looker

Libraries

pandas NumPy matplotlib seaborn scikit-learn scipy tensorflow pytorch

Tools

vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone

Contact