### PySpark Diabetes Prediction ML Project
Overview: Diabetes Prediction ML Project using PySpark.
Technologies Used: PySpark.
[View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/42)
![Project Image](/portfolio/images/42.png)
|
|
### iTunes Podcast Reviews Dashboards Tableau
Overview: Visualization of iTunes podcast reviews using interactive dashboards.
Technologies Used: Tableau.
[View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/41)
![Project Image](/portfolio/images/41.png)
|
### Customer K-means clustering in Python
Overview: Clustering customer data to identify distinct groups for targeted marketing.
Technologies Used: Python, K-means clustering algorithm.
[View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/30)
![Project Image](/portfolio/images/30.png)
|
</tr>
### Machine Learning: Decision Tree with KNIME
Overview: Using decision trees for predictive modeling in KNIME.
Technologies Used: KNIME.
[View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/31)
![Project Image](/portfolio/images/31_1.png) ![Project Image](/portfolio/images/31_2.png)
|
### NLP Challenge: IMDB Dataset of 50K Movie Reviews to perform Sentiment Analysis
Overview: Analyzing a large dataset of movie reviews to determine sentiment trends using NLP techniques.
Technologies Used: Python, Natural Language Processing.
[View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/32)
![Project Image](/portfolio/images/32.png)
|
### Recommendation System. Collaborative Filtering
Overview: Building a collaborative filtering system to recommend products to users based on similar user preferences.
Technologies Used: Python, Machine Learning.
[View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/35)
![Project Image](/portfolio/images/35.png)
|
### Book Recommendation Model. K-Nearest Neighbors
Overview: Utilizing the K-Nearest Neighbors algorithm to create a book recommendation system.
Technologies Used: Python, K-Nearest Neighbors.
[View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/37)
![Project Image](/portfolio/images/37.png)
|
</table>
### Amazon Customer Reviews Sentiment Analysis
Overview: Performing sentiment analysis on Amazon customer reviews to gauge consumer satisfaction.
Technologies Used: Python, Natural Language Processing.
[View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/33)
![Project Image](/portfolio/images/33.png)
|
### Image Classifier using TensorFlow. Keras
Overview: Building an image classification model using TensorFlow and Keras.
Technologies Used: TensorFlow, Keras.
[View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/36)
![Project Image](/portfolio/images/36.png)
|
### Linear Regression Health Costs Calculator
Overview: Creating a health costs prediction model using linear regression.
Technologies Used: Python, Linear Regression.
[View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/38)
![Project Image](/portfolio/images/38.png)
|
### Neural Network SMS Text Classifier
Overview: Developing a text classification system using neural networks to categorize SMS messages.
Technologies Used: Python, Neural Networks.
[View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/39)
![Project Image](/portfolio/images/39.png)
|
### Sentiment Analysis of Yelp Business Reviews
Overview: Analyzing Yelp reviews to extract business insights through sentiment analysis.
Technologies Used: Python, Natural Language Processing.
[View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/15)
![Project Image](/portfolio/images/15.png)
|
### Using Streamlit for Data Visualisation
Overview: Developing interactive data visualizations using Streamlit to enable dynamic user interactions.
Technologies Used: Streamlit, Python.
[View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/18)
![Project Image](/portfolio/images/18_1.png) ![Project Image](/portfolio/images/18_2.png)
|
### WEB scraping and Sentiment Analysis British Airways Customer Reviews
Overview: Extracting and analyzing sentiment from British Airways customer reviews through web scraping.
Technologies Used: Python, Web Scraping, Natural Language Processing.
[View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/24)
![Project Image](/portfolio/images/24_1.png) ![Project Image](/portfolio/images/24_2.png)
|
### Creating Dynamic Filters in Streamlit
Overview: Building a Streamlit application that incorporates dynamic filters for data manipulation.
Technologies Used: Streamlit, Python.
[View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/19)
![Project Image](/portfolio/images/19_1.png) ![Project Image](/portfolio/images/19_2.png)
|
### Predicting Customer Behaviour British Airways
Overview: Using data analysis and machine learning to predict customer behavior for British Airways.
Technologies Used: Python, Machine Learning Algorithms.
[View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/25)
![Project Image](/portfolio/images/25_1.png) ![Project Image](/portfolio/images/25_2.png)
|
### Kaggle Housing Prices Competition
Overview: Participating in the Kaggle competition to predict housing prices based on various features.
Technologies Used: Python, Machine Learning, Regression Analysis.
[View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/28)
![Project Image](/portfolio/images/28.png)
|
### Kaggle Store Sales - Time Series Forecasting
Overview: Forecasting store sales using time series analysis in a Kaggle competition.
Technologies Used: Python, Time Series Analysis, Machine Learning.
[View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/34)
![Project Image](/portfolio/images/34.png)
|
### Supervised ML: Regression Tree in Python
Overview: Implementing a regression tree to predict outcomes based on a set of input variables.
Technologies Used: Python, Decision Trees.
[View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/29)
![Project Image](/portfolio/images/29.png)
|
### Machine Learning Analysis in Retail
Overview: Analyzing retail data using machine learning to optimize inventory and sales strategies.
Technologies Used: Python, Machine Learning.
[View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/21)
![Project Image](/portfolio/images/21.png)
|
### Credit Card Fraud Detection using Scikit-Learn and Snap ML
Overview: Developing a model to detect fraudulent transactions using machine learning.
Technologies Used: Python, Scikit-Learn, Snap ML.
[View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/22)
![Project Image](/portfolio/images/22_1.png) ![Project Image](/portfolio/images/22_2.png)
|
### Natural Language Processing with Hugging Face Transformers
Overview: Leveraging Hugging Face Transformers for advanced natural language processing tasks.
Technologies Used: Python, Hugging Face Transformers.
[View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/23)
![Project Image](/portfolio/images/23_1.png) ![Project Image](/portfolio/images/23_2.png)
|
### Auto Exploratory Data Analysis with D-Tale, SweetViz, Pandas Profiling
Overview: Automating the exploratory data analysis process using various Python libraries.
Technologies Used: Python, D-Tale, SweetViz, Pandas Profiling.
[View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/26)
![Project Image](/portfolio/images/26.png)
|
### Auto ML and Bespoke ML with sklearn (Random Forest, Logistic Regression, SVC)
Overview: Implementing both automated and custom machine learning solutions using Scikit-Learn.
Technologies Used: Python, Scikit-Learn.
[View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/27)
![Project Image](/portfolio/images/27.png)
|
### Assess the Quality of a Dataset for a Public Service Agency
Overview: Evaluating and improving the quality of a dataset used by a public service agency.
Technologies Used: Data Quality Assessment.
[View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/4)
![Project Image](/portfolio/images/4.png)
|
### Data Transformation Pipeline with Cloud Dataprep (Alteryx)
Overview: Designing and implementing a data transformation pipeline using Cloud Dataprep similar to Alteryx.
Technologies Used: Cloud Dataprep, Alteryx.
[View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/40)
![Project Image](/portfolio/images/40.png)
|
### Correlation in Python
Overview: Exploring statistical correlations within datasets using Python.
Technologies Used: Python, Statistical Analysis.
[View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/20)
![Project Image](/portfolio/images/20.png)
|
### Explore Data Using SQL in Google Colab
Overview: Conducting data exploration and analysis using SQL within the Google Colab environment.
Technologies Used: SQL, Google Colab.
[View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/17)
![Project Image](/portfolio/images/17.png)
|
### SQL Sub-queries in Google Colab
Overview: Demonstrating the use of SQL sub-queries for complex data queries in Google Colab.
Technologies Used: SQL, Google Colab.
[View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/16)
![Project Image](/portfolio/images/16.png)
|
### Create a Dashboard Meeting Business Requirements
Overview: Developing a customized dashboard to meet specific business analysis needs.
Technologies Used: Dashboard Design, Business Analysis.
[View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/6)
![Project Image](/portfolio/images/6.png)
|
### Retrieve User Activity Data on an Online Forum Using SQL
Overview: Extracting and analyzing user activity data from an online forum using SQL.
Technologies Used: SQL, Data Analysis.
[View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/7)
![Project Image](/portfolio/images/7.png)
|
### Working with Web APIs and JSON on Movies Dataset
Overview: Utilizing web APIs to fetch and process movie data stored in JSON format.
Technologies Used: Web APIs, JSON, Python.
[View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/2)
![Project Image](/portfolio/images/2.png)
|
### Explore a Dataset on Energy Usage and Draw First Conclusions
Overview: Analyzing an energy usage dataset to uncover patterns and draw initial conclusions.
Technologies Used: Data Analysis, Visualization Techniques.
[View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/5)
![Project Image](/portfolio/images/5.png)
|
### Create a Web Server and an Amazon RDS DB Instance
Overview: Setting up a web server connected to an Amazon RDS database for handling dynamic web applications.
Technologies Used: Web Server Management, Amazon RDS.
[View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/3)
![Project Image](/portfolio/images/3.png)
|
### Data Analysis using Pandas and SQLite3
Overview: Conducting comprehensive data analysis using Pandas in conjunction with SQLite3 for database management.
Technologies Used: Pandas, SQLite3, Python.
[View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/14)
![Project Image](/portfolio/images/14.png)
|
### E-commerce Store Sales Analysis
Overview: Analyzing sales data from an e-commerce platform to optimize marketing and sales strategies.
Technologies Used: Data Analysis, Business Intelligence.
[View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/8)
![Project Image](/portfolio/images/8.png)
|
### Exploratory Data Analysis on Diamonds Dataset
Overview: Performing exploratory data analysis on a dataset of diamonds to understand pricing factors.
Technologies Used: Data Visualization, Statistical Analysis.
[View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/9)
![Project Image](/portfolio/images/9.png)
|
### Data Cleaning, Transformation, and Visualisation on AirBnB London Dataset
Overview: Cleaning, transforming, and visualizing data from the AirBnB London dataset to derive actionable insights.
Technologies Used: Data Cleaning, Data Transformation, Data Visualization.
[View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/12)
![Project Image](/portfolio/images/12.png)
|
### Data Cleaning on Movies Dataset
Overview: Performing data cleaning on a comprehensive movies dataset to prepare for further analysis.
Technologies Used: Data Cleaning, Python.
[View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/10)
![Project Image](/portfolio/images/10.png)
|
### Short-Term Rental Analytics on AirBnB Bristol Dataset
Overview: Analyzing short-term rental data from Airbnb in Bristol to understand market trends and rental dynamics.
Technologies Used: Data Analysis, Business Intelligence.
[View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/11)
![Project Image](/portfolio/images/11.png)
|
### Data Cleaning, Merging, Transforming on Movies Dataset
Overview: Enhancing a movies dataset by cleaning, merging, and transforming data to support detailed analysis.
Technologies Used: Data Cleaning, Data Merging, Data Transformation.
[View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/13)
![Project Image](/portfolio/images/13.png)
|
### Exploratory Data Analysis on Movies Dataset
Overview: Conducting exploratory data analysis on a movies dataset to uncover trends and insights.
Technologies Used: Data Analysis, Visualization.
[View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/1)
![Project Image](/portfolio/images/1.png)
|
</table>
|