Siddharth Prasad

Data Scientist & ML Enthusiast

LinkedIn

[email protected]

8076603030

Delhi, IN

About

Highly motivated Data Scientist and ML Enthusiast with hands-on experience in developing and deploying robust machine learning models for critical applications like fraud detection, churn prediction, and NLP chatbots. Proficient in Python, SQL, ML, and data visualization tools (Power BI, Tableau), adept at transforming complex datasets into actionable insights to drive smart, data-driven business decisions and achieve measurable impact.

Work Experience

NLP Developer (Project work)

Robustrix AI

Jan 2023 - Dec 2023

Delhi, India, IN

Developed and integrated NLP solutions for a news aggregation platform, enhancing content processing and real-time data insights.

  • Engineered a news aggregation and sentiment analysis pipeline, processing approximately 5,000 articles daily using APIs and NLP.
  • Implemented advanced topic tagging and de-duplication algorithms, resulting in a 35% reduction in duplicate content.
  • Integrated enhanced NLP features into an AI dashboard, providing real-time updates for improved data analysis and decision-making.

Freelance Project Developer

Academic Projects (B.Tech Students - Freelance)

Jan 2022 - Dec 2023

Delhi, India, IN

Delivered diverse academic projects for B.Tech students, ensuring timely and successful deployment of custom solutions.

  • Developed and delivered over 5 academic projects, including chatbots, voice assistants, games, and websites, for B.Tech students.
  • Achieved 100% on-time project delivery, ensuring successful deployment for student submissions and demonstrations.

Education

Arts

Delhi University

Sep 2021 - May 2025

Delhi, India, IN

Certificates

Data Science

National Institute of Electronics and Information Technology

Python for Data Analysis

Great Learning

Business Professional Programmer (O level)

National Institute of Electronics Information Technology

CCC

National Institute of Electronics and Information Technology

Projects

Credit Card Fraud Detection

Apr 2024 - Jun 2024

Built an end-to-end pipeline for credit card fraud detection, leveraging Random Forest and SMOTE on a large transaction dataset.

Email Spam Classification

Jan 2024 - Mar 2024

Developed an email spam classification system using TF-IDF, Naïve Bayes, and Logistic Regression to efficiently filter emails.

Real Estate Price Prediction

Oct 2023 - Dec 2023

Developed a content-based recommender system for real estate price prediction, delivering personalized property suggestions.

Movie Recommendation System

Jul 2023 - Sep 2023

Engineered and deployed a content-based movie recommender system using TF-IDF and cosine similarity on a dataset of 5,000 movies.

Crop Recommendation System

Apr 2023 - Jun 2023

Designed a recommendation system to suggest optimal crops based on environmental parameters, enhancing agricultural decision-making.

AI Doctor Chatbot

Jan 2023 - Mar 2023

Developed an NLP-based chatbot to provide medical Q&A, leveraging Python and Regex for robust functionality.

Skills

Programming Languages

  • Python
  • SQL

Machine Learning

  • Supervised Learning
  • Unsupervised Learning
  • Natural Language Processing (NLP)
  • Fraud Detection
  • Churn Prediction
  • Recommendation Systems
  • Classification
  • Regression
  • Decision Trees
  • Random Forest
  • TF-IDF
  • Cosine Similarity
  • Naïve Bayes
  • Logistic Regression
  • SMOTE

Frameworks & Libraries

  • LangChain
  • Hugging Face
  • OpenAI API
  • Flask
  • Streamlit
  • Regex

Data Visualization

  • Tableau
  • Power BI

Databases

  • PostgreSQL
  • MongoDB

Cloud Platforms

  • AWS (Basic)