portfolio

DATA PROJECTS

### Market Basket Analysis Overview: Market Basket Analysis project based on "The Bread Basket" dataset to identify purchasing patterns and generate actionable insights using association rules. **Technologies Used**: Python, Pandas, Seaborn, mlxtend, Pyvis. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/43) ![Project Image](/portfolio/images/43.png) ### PySpark Diabetes Prediction ML Project Overview: Diabetes Prediction ML Project using PySpark. **Technologies Used**: PySpark. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/42) ![Project Image](/portfolio/images/42.png) ### iTunes Podcast Reviews Dashboards Tableau Overview: Visualization of iTunes podcast reviews using interactive dashboards. **Technologies Used**: Tableau. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/41) ![Project Image](/portfolio/images/41.png)
### Customer K-means clustering in Python Overview: Clustering customer data to identify distinct groups for targeted marketing. **Technologies Used**: Python, K-means clustering algorithm. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/30) ![Project Image](/portfolio/images/30.png) ### Machine Learning: Decision Tree with KNIME Overview: Using decision trees for predictive modeling in KNIME. **Technologies Used**: KNIME. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/31) ![Project Image](/portfolio/images/31_1.png) ![Project Image](/portfolio/images/31_2.png) ### NLP Challenge: IMDB Dataset of 50K Movie Reviews to perform Sentiment Analysis Overview: Analyzing a large dataset of movie reviews to determine sentiment trends using NLP techniques. **Technologies Used**: Python, Natural Language Processing. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/32) ![Project Image](/portfolio/images/32.png)
### Recommendation System. Collaborative Filtering Overview: Building a collaborative filtering system to recommend products to users based on similar user preferences. **Technologies Used**: Python, Machine Learning. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/35) ![Project Image](/portfolio/images/35.png) ### Book Recommendation Model. K-Nearest Neighbors Overview: Utilizing the K-Nearest Neighbors algorithm to create a book recommendation system. **Technologies Used**: Python, K-Nearest Neighbors. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/37) ![Project Image](/portfolio/images/37.png) ### Amazon Customer Reviews Sentiment Analysis Overview: Performing sentiment analysis on Amazon customer reviews to gauge consumer satisfaction. **Technologies Used**: Python, Natural Language Processing. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/33) ![Project Image](/portfolio/images/33.png)
### Image Classifier using TensorFlow. Keras Overview: Building an image classification model using TensorFlow and Keras. **Technologies Used**: TensorFlow, Keras. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/36) ![Project Image](/portfolio/images/36.png) ### Linear Regression Health Costs Calculator Overview: Creating a health costs prediction model using linear regression. **Technologies Used**: Python, Linear Regression. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/38) ![Project Image](/portfolio/images/38.png) ### Neural Network SMS Text Classifier Overview: Developing a text classification system using neural networks to categorize SMS messages. **Technologies Used**: Python, Neural Networks. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/39) ![Project Image](/portfolio/images/39.png)
### Sentiment Analysis of Yelp Business Reviews Overview: Analyzing Yelp reviews to extract business insights through sentiment analysis. **Technologies Used**: Python, Natural Language Processing. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/15) ![Project Image](/portfolio/images/15.png) ### Using Streamlit for Data Visualisation Overview: Developing interactive data visualizations using Streamlit to enable dynamic user interactions. **Technologies Used**: Streamlit, Python. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/18) ![Project Image](/portfolio/images/18_1.png) ![Project Image](/portfolio/images/18_2.png) ### WEB scraping and Sentiment Analysis British Airways Customer Reviews Overview: Extracting and analyzing sentiment from British Airways customer reviews through web scraping. **Technologies Used**: Python, Web Scraping, Natural Language Processing. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/24) ![Project Image](/portfolio/images/24_1.png) ![Project Image](/portfolio/images/24_2.png)
### Creating Dynamic Filters in Streamlit Overview: Building a Streamlit application that incorporates dynamic filters for data manipulation. **Technologies Used**: Streamlit, Python. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/19) ![Project Image](/portfolio/images/19_1.png) ![Project Image](/portfolio/images/19_2.png) ### Predicting Customer Behaviour British Airways Overview: Using data analysis and machine learning to predict customer behavior for British Airways. **Technologies Used**: Python, Machine Learning Algorithms. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/25) ![Project Image](/portfolio/images/25_1.png) ![Project Image](/portfolio/images/25_2.png) ### Kaggle Housing Prices Competition Overview: Participating in the Kaggle competition to predict housing prices based on various features. **Technologies Used**: Python, Machine Learning, Regression Analysis. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/28) ![Project Image](/portfolio/images/28.png)
### Kaggle Store Sales - Time Series Forecasting Overview: Forecasting store sales using time series analysis in a Kaggle competition. **Technologies Used**: Python, Time Series Analysis, Machine Learning. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/34) ![Project Image](/portfolio/images/34.png) ### Supervised ML: Regression Tree in Python Overview: Implementing a regression tree to predict outcomes based on a set of input variables. **Technologies Used**: Python, Decision Trees. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/29) ![Project Image](/portfolio/images/29.png) ### Machine Learning Analysis in Retail Overview: Analyzing retail data using machine learning to optimize inventory and sales strategies. **Technologies Used**: Python, Machine Learning. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/21) ![Project Image](/portfolio/images/21.png)
### Credit Card Fraud Detection using Scikit-Learn and Snap ML Overview: Developing a model to detect fraudulent transactions using machine learning. **Technologies Used**: Python, Scikit-Learn, Snap ML. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/22) ![Project Image](/portfolio/images/22_1.png) ![Project Image](/portfolio/images/22_2.png) ### Natural Language Processing with Hugging Face Transformers Overview: Leveraging Hugging Face Transformers for advanced natural language processing tasks. **Technologies Used**: Python, Hugging Face Transformers. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/23) ![Project Image](/portfolio/images/23_1.png) ![Project Image](/portfolio/images/23_2.png) ### Auto Exploratory Data Analysis with D-Tale, SweetViz, Pandas Profiling Overview: Automating the exploratory data analysis process using various Python libraries. **Technologies Used**: Python, D-Tale, SweetViz, Pandas Profiling. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/26) ![Project Image](/portfolio/images/26.png)
### Auto ML and Bespoke ML with sklearn (Random Forest, Logistic Regression, SVC) Overview: Implementing both automated and custom machine learning solutions using Scikit-Learn. **Technologies Used**: Python, Scikit-Learn. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/27) ![Project Image](/portfolio/images/27.png) ### Assess the Quality of a Dataset for a Public Service Agency Overview: Evaluating and improving the quality of a dataset used by a public service agency. **Technologies Used**: Data Quality Assessment. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/4) ![Project Image](/portfolio/images/4.png) ### Data Transformation Pipeline with Cloud Dataprep (Alteryx) Overview: Designing and implementing a data transformation pipeline using Cloud Dataprep similar to Alteryx. **Technologies Used**: Cloud Dataprep, Alteryx. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/40) ![Project Image](/portfolio/images/40.png)
### Correlation in Python Overview: Exploring statistical correlations within datasets using Python. **Technologies Used**: Python, Statistical Analysis. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/20) ![Project Image](/portfolio/images/20.png) ### Explore Data Using SQL in Google Colab Overview: Conducting data exploration and analysis using SQL within the Google Colab environment. **Technologies Used**: SQL, Google Colab. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/17) ![Project Image](/portfolio/images/17.png) ### SQL Sub-queries in Google Colab Overview: Demonstrating the use of SQL sub-queries for complex data queries in Google Colab. **Technologies Used**: SQL, Google Colab. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/16) ![Project Image](/portfolio/images/16.png)
### Create a Dashboard Meeting Business Requirements Overview: Developing a customized dashboard to meet specific business analysis needs. **Technologies Used**: Dashboard Design, Business Analysis. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/6) ![Project Image](/portfolio/images/6.png) ### Retrieve User Activity Data on an Online Forum Using SQL Overview: Extracting and analyzing user activity data from an online forum using SQL. **Technologies Used**: SQL, Data Analysis. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/7) ![Project Image](/portfolio/images/7.png) ### Working with Web APIs and JSON on Movies Dataset Overview: Utilizing web APIs to fetch and process movie data stored in JSON format. **Technologies Used**: Web APIs, JSON, Python. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/2) ![Project Image](/portfolio/images/2.png)
### Explore a Dataset on Energy Usage and Draw First Conclusions Overview: Analyzing an energy usage dataset to uncover patterns and draw initial conclusions. **Technologies Used**: Data Analysis, Visualization Techniques. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/5) ![Project Image](/portfolio/images/5.png) ### Create a Web Server and an Amazon RDS DB Instance Overview: Setting up a web server connected to an Amazon RDS database for handling dynamic web applications. **Technologies Used**: Web Server Management, Amazon RDS. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/3) ![Project Image](/portfolio/images/3.png) ### Data Analysis using Pandas and SQLite3 Overview: Conducting comprehensive data analysis using Pandas in conjunction with SQLite3 for database management. **Technologies Used**: Pandas, SQLite3, Python. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/14) ![Project Image](/portfolio/images/14.png)
### E-commerce Store Sales Analysis Overview: Analyzing sales data from an e-commerce platform to optimize marketing and sales strategies. **Technologies Used**: Data Analysis, Business Intelligence. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/8) ![Project Image](/portfolio/images/8.png) ### Exploratory Data Analysis on Diamonds Dataset Overview: Performing exploratory data analysis on a dataset of diamonds to understand pricing factors. **Technologies Used**: Data Visualization, Statistical Analysis. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/9) ![Project Image](/portfolio/images/9.png) ### Data Cleaning, Transformation, and Visualisation on AirBnB London Dataset Overview: Cleaning, transforming, and visualizing data from the AirBnB London dataset to derive actionable insights. **Technologies Used**: Data Cleaning, Data Transformation, Data Visualization. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/12) ![Project Image](/portfolio/images/12.png)
### Data Cleaning on Movies Dataset Overview: Performing data cleaning on a comprehensive movies dataset to prepare for further analysis. **Technologies Used**: Data Cleaning, Python. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/10) ![Project Image](/portfolio/images/10.png) ### Short-Term Rental Analytics on AirBnB Bristol Dataset Overview: Analyzing short-term rental data from Airbnb in Bristol to understand market trends and rental dynamics. **Technologies Used**: Data Analysis, Business Intelligence. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/11) ![Project Image](/portfolio/images/11.png) ### Data Cleaning, Merging, Transforming on Movies Dataset Overview: Enhancing a movies dataset by cleaning, merging, and transforming data to support detailed analysis. **Technologies Used**: Data Cleaning, Data Merging, Data Transformation. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/13) ![Project Image](/portfolio/images/13.png)
### Exploratory Data Analysis on Movies Dataset Overview: Conducting exploratory data analysis on a movies dataset to uncover trends and insights. **Technologies Used**: Data Analysis, Visualization. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/1) ![Project Image](/portfolio/images/1.png)