Skip to content
View Tushar-Siddik's full-sized avatar
Block or Report

Block or report Tushar-Siddik

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Tushar-Siddik/README.md

Data Science Enthusiast πŸ’»

Hi, I'm Md. Siddiqur Rahman, a passionate data science enthusiast with a keen interest in uncovering insights from complex datasets. I enjoy working on end-to-end data science projects, from data collection and cleaning to modeling and visualization. My focus is on creating data-driven solutions that make a meaningful impact.

🧠 Skills & Expertise

Languages & Tools

  • Languages: Python, HTML, CSS
  • Data Analysis: pandas, numpy, SQL, Excel
  • Database: SQLite, MongoDB
  • Version control: Git, GitHUb
  • Data Visualization: matplotlib, seaborn, Plotly
  • Machine Learning: scikit-learn, statsmodels, linearmodels
  • IDE: VS code, Jupyter notebook, Kaggle
  • Data Engineering: ETL
  • More Tools: Word, Powerpoint

Soft Skills

  • Problem-Solving: Strong analytical and critical thinking skills.
  • Collaboration: Experience working in multidisciplinary teams.
  • Communication: Ability to explain complex concepts in simple terms.
  • Project Management: Skilled in Agile methodologies and project coordination.

Work Experience

Machine Learning Internship

  • Company: Mentorness
  • Duration: April, 2024 - May, 2024
  • Credential: Certificate
  • Responsibilities:
    • Authored and presented a technical article on a machine learning topic to internal and external stakeholders.
    • Built and evaluated a machine learning model to predict mobile phone price classes, using data preprocessing, feature engineering, and model assessment techniques.
    • Analyzed the ICC Men's T20 Cricket World Cup dataset, creating visualizations to identify and present key insights.
    • Deployed a machine learning model to a production environment, ensuring functionality and reliability. Collaborated with stakeholders to validate the deployment process.

πŸ“š Projects

1. Mobile Price Classification

  • Built a predictive model to classify mobile phones into predefined price ranges, using attributes such as battery power, camera features, memory, connectivity options, and more.

  • Developed five distinct models: Logistic Regression, Support Vector Classifier, Decision Tree Classifier, Random Forest Classifier, and Gradient Boosting Classifier.

  • Conducted hyperparameter tuning and cross-validation using grid search to optimize model performance.

  • Deployed the best-performing model, which achieved a 98% accuracy rate, using Streamlit for interactive visualization and user interaction.

  • Created an intuitive interface to allow users to interact with the model, providing real-time predictions and insights based on specific mobile phone attributes.

  • Tech Stack: Python, pandas, scikit-leran, matplotlib, streamlit

  • Key Learnings: Data preprocessing, feature engineering, model evaluation, data analysis, data visualization

  • Live: Mobile Price Classification

  • GitHub Repo: Link to repo

  • Jupyter Notebook: Link to Kaggle

2. ICC Men's T20 Cricket 2022 Data Analysis

  • Analyzed a comprehensive dataset from a major cricket tournament, focusing on batting and bowling statistics.

  • Identified and explained key batting metrics, including most runs scored, highest strike rates, and best batting averages, across different teams, innings, and individual players.

  • Evaluated key bowling metrics, such as most wickets taken, lowest economy rates, and lowest average runs conceded, contextualizing the data by innings, teams, and individual bowlers.

  • Conducted in-depth analysis of additional features, including most boundaries hit, to provide comprehensive insights into the tournament's performance trends.

  • Tech Stack: Python, pandas, plotly, matplotlib, dash, gunicorn

  • Key Learnings: Data preprocessing, Data Analysis, Data Visualization

  • Live: ICC Men's T20 Cricket 2022 Data Analysis

  • GitHub Repo: Link to repo

  • Jupyter Notebook: Link to Kaggle

πŸŽ“ Education

  • Bachelor of Science in Economics

    • University: Jahangirnagar University
    • Graduation Year: 2021
    • Relevant Coursework: Econometrics, Data Visualization, Statistics, Linear Algebra, Calculus
  • Certification in Data Science

    • Organization: World Quant University
    • Completion Year: 2023
    • Description: A comprehensive data science certification covering various data science and machine learning topics. The tools to interact with databses (i.e., mongodb, sqlite) are also important features.
    • Credential: Credly Badge
  • Certification in Python

    • Organization: Harvard University
    • Completion Year: 2023
    • Description: A Python course with David Malan introduces foundational topics like functions, variables, conditionals, and loops. It includes practical training in reading, writing, testing, and debugging code, with a focus on exception handling and unit testing. The course also covers regular expressions for data manipulation, object-oriented programming to model real-world entities, and file operations for reading and writing files. Third-party libraries are explored to extend Python's capabilities, equipping students with the skills to address real-world coding challenges.
    • Credential: Certificate
  • Certification in Python (Crash Course)

    • Organization: Google (Coursera)
    • Completion Year: 2021
    • Description: A Python programming course designed for beginners covered foundational concepts, emphasizing the benefits of Python in IT. It included basic syntax and hands-on practice with different code editors, allowing exploration into writing computer programs. The course demonstrated that with the right code, computers can accomplish a lot.
    • Credential: Certificate

πŸ“« Contact

I'm open to collaboration and enjoy discussing all things data science. Don't hesitate to reach out!

Popular repositories

  1. kitkat_clock01 kitkat_clock01 Public

    CSS responsive clock

    HTML 1

  2. bookshelf-management bookshelf-management Public

    Forked from EbookFoundation/bookshelf-management

    Application for managing bookshelves on project gutenberg site.

    Python 1

  3. design-resources-for-developers design-resources-for-developers Public

    Forked from bradtraversy/design-resources-for-developers

    Curated list of design and UI resources from stock photos, web templates, CSS frameworks, UI libraries, tools and much more

    1

  4. regression_discontinuity_design regression_discontinuity_design Public

    Jupyter Notebook 1

  5. Agriculture-Survey-excel-file-template Agriculture-Survey-excel-file-template Public

    Survey template (excel) about the costing of rice, potato and onion under the Agricultural economics course (Econ-201) of Jahangirnagar University

  6. docs-index.html docs-index.html Public