portfolio

DATA PROJECTS

</td> </tr> </tr> </table>
### PySpark Diabetes Prediction ML Project Overview: Diabetes Prediction ML Project using PySpark. Technologies Used: PySpark. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/42) ![Project Image](/portfolio/images/42.png) ### iTunes Podcast Reviews Dashboards Tableau Overview: Visualization of iTunes podcast reviews using interactive dashboards. Technologies Used: Tableau. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/41) ![Project Image](/portfolio/images/41.png) ### Customer K-means clustering in Python Overview: Clustering customer data to identify distinct groups for targeted marketing. Technologies Used: Python, K-means clustering algorithm. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/30) ![Project Image](/portfolio/images/30.png)
### Machine Learning: Decision Tree with KNIME Overview: Using decision trees for predictive modeling in KNIME. Technologies Used: KNIME. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/31) ![Project Image](/portfolio/images/31_1.png) ![Project Image](/portfolio/images/31_2.png) ### NLP Challenge: IMDB Dataset of 50K Movie Reviews to perform Sentiment Analysis Overview: Analyzing a large dataset of movie reviews to determine sentiment trends using NLP techniques. Technologies Used: Python, Natural Language Processing. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/32) ![Project Image](/portfolio/images/32.png) ### Recommendation System. Collaborative Filtering Overview: Building a collaborative filtering system to recommend products to users based on similar user preferences. Technologies Used: Python, Machine Learning. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/35) ![Project Image](/portfolio/images/35.png) ### Book Recommendation Model. K-Nearest Neighbors Overview: Utilizing the K-Nearest Neighbors algorithm to create a book recommendation system. Technologies Used: Python, K-Nearest Neighbors. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/37) ![Project Image](/portfolio/images/37.png)
### Amazon Customer Reviews Sentiment Analysis Overview: Performing sentiment analysis on Amazon customer reviews to gauge consumer satisfaction. Technologies Used: Python, Natural Language Processing. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/33) ![Project Image](/portfolio/images/33.png) ### Image Classifier using TensorFlow. Keras Overview: Building an image classification model using TensorFlow and Keras. Technologies Used: TensorFlow, Keras. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/36) ![Project Image](/portfolio/images/36.png) ### Linear Regression Health Costs Calculator Overview: Creating a health costs prediction model using linear regression. Technologies Used: Python, Linear Regression. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/38) ![Project Image](/portfolio/images/38.png)
### Neural Network SMS Text Classifier Overview: Developing a text classification system using neural networks to categorize SMS messages. Technologies Used: Python, Neural Networks. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/39) ![Project Image](/portfolio/images/39.png) ### Sentiment Analysis of Yelp Business Reviews Overview: Analyzing Yelp reviews to extract business insights through sentiment analysis. Technologies Used: Python, Natural Language Processing. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/15) ![Project Image](/portfolio/images/15.png) ### Using Streamlit for Data Visualisation Overview: Developing interactive data visualizations using Streamlit to enable dynamic user interactions. Technologies Used: Streamlit, Python. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/18) ![Project Image](/portfolio/images/18_1.png) ![Project Image](/portfolio/images/18_2.png)
### WEB scraping and Sentiment Analysis British Airways Customer Reviews Overview: Extracting and analyzing sentiment from British Airways customer reviews through web scraping. Technologies Used: Python, Web Scraping, Natural Language Processing. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/24) ![Project Image](/portfolio/images/24_1.png) ![Project Image](/portfolio/images/24_2.png) ### Creating Dynamic Filters in Streamlit Overview: Building a Streamlit application that incorporates dynamic filters for data manipulation. Technologies Used: Streamlit, Python. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/19) ![Project Image](/portfolio/images/19_1.png) ![Project Image](/portfolio/images/19_2.png) ### Predicting Customer Behaviour British Airways Overview: Using data analysis and machine learning to predict customer behavior for British Airways. Technologies Used: Python, Machine Learning Algorithms. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/25) ![Project Image](/portfolio/images/25_1.png) ![Project Image](/portfolio/images/25_2.png)
### Kaggle Housing Prices Competition Overview: Participating in the Kaggle competition to predict housing prices based on various features. Technologies Used: Python, Machine Learning, Regression Analysis. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/28) ![Project Image](/portfolio/images/28.png) ### Kaggle Store Sales - Time Series Forecasting Overview: Forecasting store sales using time series analysis in a Kaggle competition. Technologies Used: Python, Time Series Analysis, Machine Learning. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/34) ![Project Image](/portfolio/images/34.png) ### Supervised ML: Regression Tree in Python Overview: Implementing a regression tree to predict outcomes based on a set of input variables. Technologies Used: Python, Decision Trees. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/29) ![Project Image](/portfolio/images/29.png)
### Machine Learning Analysis in Retail Overview: Analyzing retail data using machine learning to optimize inventory and sales strategies. Technologies Used: Python, Machine Learning. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/21) ![Project Image](/portfolio/images/21.png) ### Credit Card Fraud Detection using Scikit-Learn and Snap ML Overview: Developing a model to detect fraudulent transactions using machine learning. Technologies Used: Python, Scikit-Learn, Snap ML. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/22) ![Project Image](/portfolio/images/22_1.png) ![Project Image](/portfolio/images/22_2.png) ### Natural Language Processing with Hugging Face Transformers Overview: Leveraging Hugging Face Transformers for advanced natural language processing tasks. Technologies Used: Python, Hugging Face Transformers. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/23) ![Project Image](/portfolio/images/23_1.png) ![Project Image](/portfolio/images/23_2.png)
### Auto Exploratory Data Analysis with D-Tale, SweetViz, Pandas Profiling Overview: Automating the exploratory data analysis process using various Python libraries. Technologies Used: Python, D-Tale, SweetViz, Pandas Profiling. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/26) ![Project Image](/portfolio/images/26.png) ### Auto ML and Bespoke ML with sklearn (Random Forest, Logistic Regression, SVC) Overview: Implementing both automated and custom machine learning solutions using Scikit-Learn. Technologies Used: Python, Scikit-Learn. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/27) ![Project Image](/portfolio/images/27.png) ### Assess the Quality of a Dataset for a Public Service Agency Overview: Evaluating and improving the quality of a dataset used by a public service agency. Technologies Used: Data Quality Assessment. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/4) ![Project Image](/portfolio/images/4.png)
### Data Transformation Pipeline with Cloud Dataprep (Alteryx) Overview: Designing and implementing a data transformation pipeline using Cloud Dataprep similar to Alteryx. Technologies Used: Cloud Dataprep, Alteryx. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/40) ![Project Image](/portfolio/images/40.png) ### Correlation in Python Overview: Exploring statistical correlations within datasets using Python. Technologies Used: Python, Statistical Analysis. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/20) ![Project Image](/portfolio/images/20.png) ### Explore Data Using SQL in Google Colab Overview: Conducting data exploration and analysis using SQL within the Google Colab environment. Technologies Used: SQL, Google Colab. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/17) ![Project Image](/portfolio/images/17.png)
### SQL Sub-queries in Google Colab Overview: Demonstrating the use of SQL sub-queries for complex data queries in Google Colab. Technologies Used: SQL, Google Colab. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/16) ![Project Image](/portfolio/images/16.png) ### Create a Dashboard Meeting Business Requirements Overview: Developing a customized dashboard to meet specific business analysis needs. Technologies Used: Dashboard Design, Business Analysis. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/6) ![Project Image](/portfolio/images/6.png) ### Retrieve User Activity Data on an Online Forum Using SQL Overview: Extracting and analyzing user activity data from an online forum using SQL. Technologies Used: SQL, Data Analysis. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/7) ![Project Image](/portfolio/images/7.png)
### Working with Web APIs and JSON on Movies Dataset Overview: Utilizing web APIs to fetch and process movie data stored in JSON format. Technologies Used: Web APIs, JSON, Python. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/2) ![Project Image](/portfolio/images/2.png) ### Explore a Dataset on Energy Usage and Draw First Conclusions Overview: Analyzing an energy usage dataset to uncover patterns and draw initial conclusions. Technologies Used: Data Analysis, Visualization Techniques. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/5) ![Project Image](/portfolio/images/5.png) ### Create a Web Server and an Amazon RDS DB Instance Overview: Setting up a web server connected to an Amazon RDS database for handling dynamic web applications. Technologies Used: Web Server Management, Amazon RDS. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/3) ![Project Image](/portfolio/images/3.png)
### Data Analysis using Pandas and SQLite3 Overview: Conducting comprehensive data analysis using Pandas in conjunction with SQLite3 for database management. Technologies Used: Pandas, SQLite3, Python. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/14) ![Project Image](/portfolio/images/14.png) ### E-commerce Store Sales Analysis Overview: Analyzing sales data from an e-commerce platform to optimize marketing and sales strategies. Technologies Used: Data Analysis, Business Intelligence. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/8) ![Project Image](/portfolio/images/8.png) ### Exploratory Data Analysis on Diamonds Dataset Overview: Performing exploratory data analysis on a dataset of diamonds to understand pricing factors. Technologies Used: Data Visualization, Statistical Analysis. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/9) ![Project Image](/portfolio/images/9.png)
### Data Cleaning, Transformation, and Visualisation on AirBnB London Dataset Overview: Cleaning, transforming, and visualizing data from the AirBnB London dataset to derive actionable insights. Technologies Used: Data Cleaning, Data Transformation, Data Visualization. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/12) ![Project Image](/portfolio/images/12.png) ### Data Cleaning on Movies Dataset Overview: Performing data cleaning on a comprehensive movies dataset to prepare for further analysis. Technologies Used: Data Cleaning, Python. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/10) ![Project Image](/portfolio/images/10.png) ### Short-Term Rental Analytics on AirBnB Bristol Dataset Overview: Analyzing short-term rental data from Airbnb in Bristol to understand market trends and rental dynamics. Technologies Used: Data Analysis, Business Intelligence. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/11) ![Project Image](/portfolio/images/11.png)
### Data Cleaning, Merging, Transforming on Movies Dataset Overview: Enhancing a movies dataset by cleaning, merging, and transforming data to support detailed analysis. Technologies Used: Data Cleaning, Data Merging, Data Transformation. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/13) ![Project Image](/portfolio/images/13.png) ### Exploratory Data Analysis on Movies Dataset Overview: Conducting exploratory data analysis on a movies dataset to uncover trends and insights. Technologies Used: Data Analysis, Visualization. [View Project's Files](https://github.com/sitshayeva/portfolio/tree/main/projects/1) ![Project Image](/portfolio/images/1.png) </table>