I'm SRINIDHI
Data Scientist
(AI/ML Research • Engineering • Analytics)
Building advanced AI/ML solutions and scalable data pipelines. Specialized in LLMs, NLP, and cloud-based machine learning systems.

1
2
3
About
A passionate and results-driven Data Scientist and Analyst with a strong background in machine learning, data engineering and BI reporting. I specialize in transforming complex datasets into actionable insights that drive business growth and innovation. Whether it's building scalable ETL pipelines, optimizing ML models or designing interactive dashboards, I thrive at the intersection of data and decision-making.
Skills
My Skills
Programming Languages & Framework
Python, SQL (MySQL, PostgreSQL) R, C, C++, HTML, CSS, Scikit-Learn, Tensorflow, PyTorch, SAS
AI / Machine Learning
Python • Scikit-Learn • TensorFlow • PyTorch • SAS • Machine Learning (Supervised & Unsupervised) • Predictive Analytics • Deep Learning (CNN, RNN, LSTM, NLP) • LLMs • RAG • AI Agents • Crew AI • LangGraph • Model Evaluation • Model Tuning
Data Engineering
Data Engineering (ETL, Airflow, AWS Glue, BigQuery) • ETL Pipelines • Automated Data Workflows • Data Quality & Reliability • AWS (EC2, Athena, Glue, Lambda, S3, EFS, RDS) • GCP (BigQuery, VPC, Dataflow) • Databricks (MLflow, Delta Lake) • Azure • Docker • Git • Apache Airflow • Kafka • Snowflake
Data Analytics & BI
Data Cleaning • Profiling • Validation • EDA • Statistical Modeling • Regression Analysis • Hypothesis Testing • Correlation Analysis • A/B Testing • Forecasting • Anomaly Detection • Root Cause Analysis • KPI Monitoring • Drill-Down Analysis • Trend Analysis • Data Storytelling • Dashboard Optimization • Ad-Hoc Reporting • Interactive Filters • Excel • Python • Tableau • Power BI • Looker Studio
Developer Tools
Excel, Power BI (DAX, M Code), Tableau, Docker, Git, Apache Airflow, Kafka, Looker Studio, Snowflake, Jupyter, Streamlit
Collaboration
Cross-functional, Data Storytelling, Stakeholder Communication, Agile Methodologies, Task Management.
Certifications
Model Validation 1 - Trainer
Handshake
Model Validation 2 - Expert
Handshake
Generative AI Fundamentals
Data Bricks
AI Security Fundamentals
Data Bricks
SQL Associate
Data Camp
Experience
AI Research Analyst
Handshake AI (Freelancing) 10/2025 – Present
Evaluated agent ideation traces and code for scientific integrity issues, improving trust and safety in reinforcement learning systems.
Refined autonomous ML agents by simulating ideation traces, flagging rogue behaviors, and proposing corrected plan-code pairs
Data Science Analyst
Madhi.Ai (San Francisco, USA) 06/2025 – Present
Designed & deployed AI agent workflows using SLMs, Crew AI and Advanced RAG to deliver context aware, high-accuracy results.
Built ETL pipelines to process PDFs, emails, CRM and other unstructured data, improving model performance
Created dashboards in Looker Studio to track KPIs, cost efficiency, and adoption metrics.
Collaborated with engineers and product teams to fine tune AI agents, reducing latency and increasing reliability in production.
Data Analyst
Armorblox (Cisco) 08/2022 – 07/2023
Developed an automated data pipeline using Snorkel & keyword-based functions, improving labeling efficiency & model accuracy to 92%.
Built and presented interactive dashboards using Power BI and Python to visualize attack types, frequency and trends, enabling datadriven decision making and product enhancement prioritization.
Analyzed large scale email data to detect fraud and sensitive information using SQL, Python and statistical methods, improving threat detection and identifying zero-day phishing attacks.
Researched and validated multiple machine learning models to improve detection accuracy and reduced false positives by 12% through automated retraining, dataset updates, and model tuning.
Collaborated with cross-functional teams (threat research, product and customer success) to operationalize insights, mitigate security threats and reduce incident response time.
AWS Cloud Data Engineer
Cognizant 02/2022 – 08/2022
Built ETL system to ingest and transform multi-source data, ensuring quality, consistency and reliability across the pipeline
Developed and optimized scalable data workflows using AWS services including Glue, Athena, EC2, Redshift, Lambda, S3 and SageMaker reducing processing latency and enhancing overall system performance.
Created interactive dashboards with Amazon QuickSight to visualize key metrics and deliver actionable insights.





