Passion Fuels Purpose!  

Biography

Hi, I'm Sagar Tate, a Full Stack Data Scientist with a dedication to crafting comprehensive and impactful data-driven solutions. Over the past 5 years, I've immersed myself in the world of data, from developing robust backend systems to creating insightful visualizations that empower data-driven decision-making.

I believe that data science goes beyond just crunching numbers – it's about uncovering meaningful patterns, extracting valuable insights, and ultimately, driving informed decisions. My approach revolves around creating end-to-end solutions that seamlessly integrate data acquisition, processing, modeling, and deployment.

Whether I'm architecting scalable data pipelines, building machine learning models, or designing intuitive data visualizations, I bring a holistic perspective to every project. My commitment to excellence extends from the backend algorithms to the frontend user interfaces, ensuring that the insights derived from data are not only powerful but also accessible and actionable. I am eager to contribute my skills and passion to the success of your next data-driven endeavor.

Sagar Tate
+

satisfied clients

+

projects completed

+

years of experience

Skills

Data Science
Hypothesis Testing
Regression
Classification
Ensemble Learning
Clustering
Sentiment Analysis
NER
LLM
OpenAI
RAG
GenAI
Langchain
Docker
Vector Database
Object Detection
Image Classification
TensorFlow
Nextjs
Tailwind
Typescript
FastAPI
Python
SQL
Git
Unix
GCP
Bigquery
AWS
Sagemaker
Azure Form Recognizer

Experience

  • Senior Data Scientist @Arya.ai

    Dec 2022 – present | Mumbai, India

    Bank Statement Analyser(BSA): •Using computer vision techniques and algorithms such as image processing, pattern recognition, and optical character recognition (OCR), I developed a solution that improved the accuracy of detecting transactions and other details in bank statements by over 95%. (API link) •The implementation of the computer vision solution had a significant impact on the organization, reducing manual labor, improving accuracy, and saving time. •Deployed and managed a RESTful API on a Google Cloud Platform using Docker for 20+ countries, ensuring scalability, security, and optimized performance. •Collaborate closely with clients to understand their unique objectives, translating their requirements into actionable steps for the implementation of our machine learning solution, leading to highly customized and effective deployments. •Bank Statement Extraction(BSE): •Built an API for extracting basic details like account holder name, A/C number, statement duration, etc. •Used OpenAI GPT model to get these basic details from raw OCR output. •Successfully deployed the API on Google Cloud Platform (GCP), ensuring scalability and reliability for seamless integration with the application's backend infrastructure. •Intelligent Document Retrieval and Chatbot System: •Document Processing: •Implemented efficient chunking logic to partition documents into manageable segments for streamlined processing. •Conducted rigorous testing of chunking efficiency, fine-tuning parameters to optimize performance. •Model Deployment and Integration: •Selected and deployed embedding and Large Language Model (LLM) models compatible with AWS Bedrock, ensuring seamless integration with the system architecture. •Configured and deployed ChromaDB on AWS as a client-server, establishing Lambda triggers for real-time document updates in S3 and efficient embedding addition/update operations •Document Access Control and Chat History Management: •Designed and implemented a schema for storing chat history in a relational database (RDS), capturing and storing each conversation with precision. •Chatbot Pipeline Development: •Developed a sophisticated pipeline to process user questions, user IDs/usernames, and chat history seamlessly. •Integrated logic to determine document access based on user ID, delivering tailored responses accordingly. •Monitoring and Validation: •Established CloudWatch metrics for monitoring Lambda calls and outputs, ensuring system reliability and performance. •Conducted extensive validation and testing of the end-to-end pipeline, crafting benchmark questions and meticulously assessing retrieval and response accuracy. •Documented testing outcomes comprehensively, iteratively refining the system for enhanced performance. •API Gateway Setup: •Orchestrated the setup of Lambda functions and API gateway to facilitate seamless API interactions, enabling effortless integration with external systems.

  • Data Scientist @Accenture

    Oct 2021 – Dec 2022 | Pune, India

    Developed a machine learning model using Python and scikit-learn library that predicted the likelihood of insurance claims being fraudulent based on claimant information and historical claims data. •Evaluated the model's performance using metrics such as precision, recall, and F1-score, and presented the results to stakeholders. •The project resulted in a 20% reduction in fraudulent claims, saving the insurance company millions of dollars in losses. •Azure cloud migration- Migrating on-prem (ASIS) to Azure cloud (ToBe)

  • System Engineer @TCS

    Dec 2018 – Oct 2021 | Pune, India

    •Getting alerts insights using pandas and NumPy- Scraping the data using selenium and BS4 to get the raw data from alert reporting tools such as WebGui and analyzing this data to avoid unnecessary failures •Primary Bigdata and ETL support- Monitoring Informatica, Abinitio and Bigdata jobs through TWS, SOA and Informatica •Triggering automatic emails to client for alerting visa batch start and end by reading live SVG graphs/ pictures from website using python •Generating daily reports such as Bigdata (Hound cluster and Omega cluster) job completion using TWS scrapped data with selenium and BS4

Education

  • Data Science and Engineering @Great Lakes

    Apr 2021 – Apr 2022 | Pune, India

    Completed a comprehensive curriculum in Data Science and Engineering, gaining proficiency in statistical analysis, machine learning, and data engineering. Acquired hands-on experience with tools and technologies relevant to the field, contributing to a solid foundation in data-driven decision-making and problem-solving

  • Bachelor Of Engineering In Mechanical @WIT

    Jul 2014 – Jul 2018 | Solapur, India

    Relevant courses and Data Structures and Algorithms.