Data Science Portfolio - Kaggle Datasets & AI Projects | Artificial Intelligence
37.9K subscribers
284 photos
76 files
337 links
Free Datasets For Data Science Projects & Portfolio

Buy ads: https://telega.io/c/DataPortfolio

For Promotions/ads: @coderfun @love_data
Download Telegram
๐—๐—ฃ ๐— ๐—ผ๐—ฟ๐—ด๐—ฎ๐—ป ๐—™๐—ฅ๐—˜๐—˜ ๐—ฉ๐—ถ๐—ฟ๐˜๐˜‚๐—ฎ๐—น ๐—–๐—ฒ๐—ฟ๐˜๐—ถ๐—ณ๐—ถ๐—ฐ๐—ฎ๐˜๐—ถ๐—ผ๐—ป ๐—ฃ๐—ฟ๐—ผ๐—ด๐—ฟ๐—ฎ๐—บ๐Ÿ˜

Want hands-on experience from a top global company without leaving your home?

These FREE virtual internship by JPMorgan on Forage let you explore careers in

โœ… Software Engineering
โœ… Investment Banking
โœ… Quantitative Research

๐‹๐ข๐ง๐ค ๐Ÿ‘‡:-

https://pdlink.in/4kStNZi

Enroll For FREE & Get Certified ๐ŸŽ“
Learn Data Science in 2024

๐Ÿญ. ๐—”๐—ฝ๐—ฝ๐—น๐˜† ๐—ฃ๐—ฎ๐—ฟ๐—ฒ๐˜๐—ผ'๐˜€ ๐—Ÿ๐—ฎ๐˜„ ๐˜๐—ผ ๐—Ÿ๐—ฒ๐—ฎ๐—ฟ๐—ป ๐—๐˜‚๐˜€๐˜ ๐—˜๐—ป๐—ผ๐˜‚๐—ด๐—ต ๐Ÿ“š

Pareto's Law states that "that 80% of consequences come from 20% of the causes".

This law should serve as a guiding framework for the volume of content you need to know to be proficient in data science.

Often rookies make the mistake of overspending their time learning algorithms that are rarely applied in production. Learning about advanced algorithms such as XLNet, Bayesian SVD++, and BiLSTMs, are cool to learn.

But, in reality, you will rarely apply such algorithms in production (unless your job demands research and application of state-of-the-art algos).

For most ML applications in production - especially in the MVP phase, simple algos like logistic regression, K-Means, random forest, and XGBoost provide the biggest bang for the buck because of their simplicity in training, interpretation and productionization.

So, invest more time learning topics that provide immediate value now, not a year later.

๐Ÿฎ. ๐—™๐—ถ๐—ป๐—ฑ ๐—ฎ ๐— ๐—ฒ๐—ป๐˜๐—ผ๐—ฟ โšก

Thereโ€™s a Japanese proverb that says โ€œBetter than a thousand days of diligent study is one day with a great teacher.โ€ This proverb directly applies to learning data science quickly.

Mentors can teach you about how to build a model in production and how to manage stakeholders - stuff that you donโ€™t often read about in courses and books.

So, find a mentor who can teach you practical knowledge in data science.

๐Ÿฏ. ๐——๐—ฒ๐—น๐—ถ๐—ฏ๐—ฒ๐—ฟ๐—ฎ๐˜๐—ฒ ๐—ฃ๐—ฟ๐—ฎ๐—ฐ๐˜๐—ถ๐—ฐ๐—ฒ โœ๏ธ

If you are serious about growing your excelling in data science, you have to put in the time to nurture your knowledge. This means that you need to spend less time watching mindless videos on TikTok and spend more time reading books and watching video lectures.

Join @datasciencefree for more

ENJOY LEARNING ๐Ÿ‘๐Ÿ‘
๐Ÿ‘4
Forwarded from Artificial Intelligence
๐—ฆ๐˜๐—ฟ๐˜‚๐—ด๐—ด๐—น๐—ถ๐—ป๐—ด ๐˜„๐—ถ๐˜๐—ต ๐—ฃ๐—ผ๐˜„๐—ฒ๐—ฟ ๐—•๐—œ? ๐—ง๐—ต๐—ถ๐˜€ ๐—–๐—ต๐—ฒ๐—ฎ๐˜ ๐—ฆ๐—ต๐—ฒ๐—ฒ๐˜ ๐—ถ๐˜€ ๐—ฌ๐—ผ๐˜‚๐—ฟ ๐—จ๐—น๐˜๐—ถ๐—บ๐—ฎ๐˜๐—ฒ ๐—ฆ๐—ต๐—ผ๐—ฟ๐˜๐—ฐ๐˜‚๐˜!๐Ÿ˜

Mastering Power BI can be overwhelming, but this cheat sheet by DataCamp makes it super easy! ๐Ÿš€

๐‹๐ข๐ง๐ค๐Ÿ‘‡:-

https://pdlink.in/4ld6F7Y

No more flipping through tabs & tutorialsโ€”just pin this cheat sheet and analyze data like a pro!โœ…๏ธ
๐Ÿ‘1
NumPy_SciPy_Pandas_Quandl_Cheat_Sheet.pdf
134.6 KB
Cheatsheet on Numpy and pandas for easy viewing ๐Ÿ‘€
ibm_machine_learning_for_dummies.pdf
1.8 MB
Short Machine Learning guide on industry applications and how itโ€™s used to resolve problems ๐Ÿ’ก
1663243982009.pdf
349.9 KB
All SQL solutions for leetcode, good luck grinding ๐Ÿซฃ
git-cheat-sheet-education.pdf
97.8 KB
Git commands cheatsheets for anyone working on personal projects on GitHub! ๐Ÿ‘พ
1655183344172.pdf
333.8 KB
Algorithmic concepts for anyone who is taking Data Structure and Algorithms, or interested in algorithmic trading ๐Ÿ˜‰
๐Ÿ‘5โค2
๐Ÿญ๐Ÿฌ๐Ÿฌ% ๐—™๐—ฅ๐—˜๐—˜ ๐—–๐—ฒ๐—ฟ๐˜๐—ถ๐—ณ๐—ถ๐—ฐ๐—ฎ๐˜๐—ถ๐—ผ๐—ป ๐—–๐—ผ๐˜‚๐—ฟ๐˜€๐—ฒ๐˜€๐Ÿ˜

Master Python, Machine Learning, SQL, and Data Visualization with hands-on tutorials & real-world datasets? ๐ŸŽฏ

This 100% FREE resource from Kaggle will help you build job-ready skillsโ€”no fluff, no fees, just pure learning!

๐‹๐ข๐ง๐ค๐Ÿ‘‡:-

https://pdlink.in/3XYAnDy

Perfect for Beginners โœ…๏ธ
Here are five of the most commonly used SQL queries in data science:

1. SELECT and FROM Clauses
- Basic data retrieval: SELECT column1, column2 FROM table_name;

2. WHERE Clause
- Filtering data: SELECT * FROM table_name WHERE condition;

3. GROUP BY and Aggregate Functions
- Summarizing data: SELECT column1, COUNT(*), AVG(column2) FROM table_name GROUP BY column1;

4. JOIN Operations
- Combining data from multiple tables:

     SELECT a.column1, b.column2
FROM table1 a
JOIN table2 b ON a.common_column = b.common_column;

5. Subqueries and Nested Queries
- Advanced data retrieval:

     SELECT column1
FROM table_name
WHERE column2 IN (SELECT column2 FROM another_table WHERE condition);

Like for more โค๏ธ

Hope it helps :)
๐Ÿ‘2
๐—ง๐—ผ๐—ฝ ๐—ฐ๐—ผ๐—บ๐—ฝ๐—ฎ๐—ป๐—ถ๐—ฒ๐˜€ ๐—ข๐—ณ๐—ณ๐—ฒ๐—ฟ๐—ถ๐—ป๐—ด ๐—™๐—ฅ๐—˜๐—˜ ๐˜ƒ๐—ถ๐—ฟ๐˜๐˜‚๐—ฎ๐—น ๐—ฒ๐˜…๐—ฝ๐—ฒ๐—ฟ๐—ถ๐—ฒ๐—ป๐—ฐ๐—ฒ ๐—ฝ๐—ฟ๐—ผ๐—ด๐—ฟ๐—ฎ๐—บ๐˜€๐Ÿ˜

Want to work on real industry tasks, develop in-demand skills, and boost your resumeโ€”all for FREE? 

 Your dream career starts with real experienceโ€”grab this opportunity today!

๐‹๐ข๐ง๐ค๐Ÿ‘‡:-

https://pdlink.in/4bCyUIM

๐Ÿ’ก No experience requiredโ€”just learn, upskill & build your portfolio! ๐Ÿš€
Free Datasets to work on Power BI + SQL projects ๐Ÿ‘‡๐Ÿ‘‡

1. AdventureWorks Sample Database:
- Link: [AdventureWorks Sample Database](https://docs.microsoft.com/en-us/sql/samples/adventureworks-install-configure?view=sql-server-ver15)
- Description: A sample database provided by Microsoft, containing sales, products, customers, and other related data.

2. Online Retail Dataset:
- Link: [UCI Machine Learning Repository - Online Retail Dataset](https://archive.ics.uci.edu/ml/datasets/online+retail)
- Description: Transactional data from an online retail store, suitable for customer segmentation and sales analysis.

3. Supermarket Sales Dataset:
- Link: [Supermarket Sales Dataset](https://www.kaggle.com/aungpyaeap/supermarket-sales)
- Description: Sales data from a supermarket, useful for inventory management and sales performance analysis.

4. Yahoo Finance (Historical Stock Data):
- Link: [Yahoo Finance](https://finance.yahoo.com/)
- Description: Historical stock data for various companies, suitable for financial analysis and visualization.

5. Human Resources Analytics: Employee Attrition and Performance:
- Link: [Kaggle HR Analytics Dataset](https://www.kaggle.com/pavansubhasht/ibm-hr-analytics-attrition-dataset)
- Description: Employee data including demographics, performance, and attrition information, suitable for employee performance analysis.

Bonus Open Sources Resources: https://t.iss.one/DataPortfolio/16

These datasets are freely available for practicing Power BI and SQL skills. You can download them from the provided links and import them into your SQL database management system (e.g., MySQL, SQL Server, PostgreSQL) for hands-on โ˜บ๏ธ๐Ÿ’ช
๐Ÿ‘3
Best 5 data analyst projects for freshers with free certification
๐Ÿ‘‡๐Ÿ‘‡
https://datasimplifier.com/best-data-analyst-projects-for-freshers/
๐Ÿ‘1
Forwarded from Generative AI
๐Ÿฑ ๐—™๐—ฅ๐—˜๐—˜ ๐——๐—ฎ๐˜๐—ฎ ๐—”๐—ป๐—ฎ๐—น๐˜†๐˜๐—ถ๐—ฐ๐˜€ ๐—–๐—ฒ๐—ฟ๐˜๐—ถ๐—ณ๐—ถ๐—ฐ๐—ฎ๐˜๐—ถ๐—ผ๐—ป ๐—–๐—ผ๐˜‚๐—ฟ๐˜€๐—ฒ๐˜€ ๐Ÿ˜

Whether youโ€™re a complete beginner or looking to level up, these courses cover Excel, Power BI, Data Science, and Real-World Analytics Projects to make you job-ready.

๐‹๐ข๐ง๐ค๐Ÿ‘‡:-

https://pdlink.in/3DPkrga

All The Best ๐ŸŽŠ
๐Ÿ‘1
Free Datasets to work on Power BI + SQL projects ๐Ÿ‘‡๐Ÿ‘‡

1. AdventureWorks Sample Database:
- Link: [AdventureWorks Sample Database](https://docs.microsoft.com/en-us/sql/samples/adventureworks-install-configure?view=sql-server-ver15)
- Description: A sample database provided by Microsoft, containing sales, products, customers, and other related data.

2. Online Retail Dataset:
- Link: [UCI Machine Learning Repository - Online Retail Dataset](https://archive.ics.uci.edu/ml/datasets/online+retail)
- Description: Transactional data from an online retail store, suitable for customer segmentation and sales analysis.

3. Supermarket Sales Dataset:
- Link: [Supermarket Sales Dataset](https://www.kaggle.com/aungpyaeap/supermarket-sales)
- Description: Sales data from a supermarket, useful for inventory management and sales performance analysis.

4. Yahoo Finance (Historical Stock Data):
- Link: [Yahoo Finance](https://finance.yahoo.com/)
- Description: Historical stock data for various companies, suitable for financial analysis and visualization.

5. Human Resources Analytics: Employee Attrition and Performance:
- Link: [Kaggle HR Analytics Dataset](https://www.kaggle.com/pavansubhasht/ibm-hr-analytics-attrition-dataset)
- Description: Employee data including demographics, performance, and attrition information, suitable for employee performance analysis.

Bonus Open Sources Resources: https://t.iss.one/DataPortfolio/16

These datasets are freely available for practicing Power BI and SQL skills. You can download them from the provided links and import them into your SQL database management system (e.g., MySQL, SQL Server, PostgreSQL) for hands-on โ˜บ๏ธ๐Ÿ’ช
๐Ÿ‘3
๐Ÿฑ ๐—™๐—ฟ๐—ฒ๐—ฒ ๐—Ÿ๐—ฒ๐—ฎ๐—ฟ๐—ป๐—ถ๐—ป๐—ด ๐—ฃ๐—น๐—ฎ๐—ป๐˜€ ๐˜๐—ผ ๐—จ๐—ฝ๐˜€๐—ธ๐—ถ๐—น๐—น ๐—ถ๐—ป ๐—ง๐—ฒ๐—ฐ๐—ต & ๐—”๐—œ!๐Ÿ˜

Looking to boost your tech career?๐Ÿš€

These free learning plans will help you stay ahead in DevOps, AI, Cloud Security, Data Analytics, and Machine Learning!๐Ÿ“Š

๐‹๐ข๐ง๐ค๐Ÿ‘‡:-

https://pdlink.in/4ijtDI2

Perfect for Beginners & Professionals Looking to Upskill!โœ…๏ธ
๐Ÿ‘1
Here are two amazing SQL Projects for data analytics ๐Ÿ‘‡๐Ÿ‘‡

Calculating Free-to-Paid Conversion Rate with SQL Project

Career Track Analysis with SQL and Tableau Project

Like this post if you need more data analytics projects in the channel ๐Ÿ˜„

Hope it helps :)
๐Ÿ‘7
๐ŸŽ“ ๐—™๐—ฟ๐—ฒ๐—ฒ ๐—–๐—ผ๐˜‚๐—ฟ๐˜€๐—ฒ๐˜€ ๐—ณ๐—ฟ๐—ผ๐—บ ๐—ข๐—ฝ๐—ฒ๐—ป ๐—จ๐—ป๐—ถ๐˜ƒ๐—ฒ๐—ฟ๐˜€๐—ถ๐˜๐˜† โ€“ ๐—Ÿ๐—ฒ๐—ฎ๐—ฟ๐—ป, ๐—š๐—ฟ๐—ผ๐˜„ & ๐—จ๐—ฝ๐˜€๐—ธ๐—ถ๐—น๐—น!๐Ÿ˜

If youโ€™re just starting your learning journey or looking to level up your skillsโ€”this is your golden opportunity! ๐ŸŒŸ

๐‹๐ข๐ง๐ค๐Ÿ‘‡:-

https://pdlink.in/4cuo73X

โณ Donโ€™t miss outโ€”bookmark this for later!
๐Ÿ‘3
๐Ÿš€ Key Skills for Aspiring Tech Specialists

๐Ÿ“Š Data Analyst:
- Proficiency in SQL for database querying
- Advanced Excel for data manipulation
- Programming with Python or R for data analysis
- Statistical analysis to understand data trends
- Data visualization tools like Tableau or PowerBI
- Data preprocessing to clean and structure data
- Exploratory data analysis techniques

๐Ÿง  Data Scientist:
- Strong knowledge of Python and R for statistical analysis
- Machine learning for predictive modeling
- Deep understanding of mathematics and statistics
- Data wrangling to prepare data for analysis
- Big data platforms like Hadoop or Spark
- Data visualization and communication skills
- Experience with A/B testing frameworks

๐Ÿ— Data Engineer:
- Expertise in SQL and NoSQL databases
- Experience with data warehousing solutions
- ETL (Extract, Transform, Load) process knowledge
- Familiarity with big data tools (e.g., Apache Spark)
- Proficient in Python, Java, or Scala
- Knowledge of cloud services like AWS, GCP, or Azure
- Understanding of data pipeline and workflow management tools

๐Ÿค– Machine Learning Engineer:
- Proficiency in Python and libraries like scikit-learn, TensorFlow
- Solid understanding of machine learning algorithms
- Experience with neural networks and deep learning frameworks
- Ability to implement models and fine-tune their parameters
- Knowledge of software engineering best practices
- Data modeling and evaluation strategies
- Strong mathematical skills, particularly in linear algebra and calculus

๐Ÿง  Deep Learning Engineer:
- Expertise in deep learning frameworks like TensorFlow or PyTorch
- Understanding of Convolutional and Recurrent Neural Networks
- Experience with GPU computing and parallel processing
- Familiarity with computer vision and natural language processing
- Ability to handle large datasets and train complex models
- Research mindset to keep up with the latest developments in deep learning

๐Ÿคฏ AI Engineer:
- Solid foundation in algorithms, logic, and mathematics
- Proficiency in programming languages like Python or C++
- Experience with AI technologies including ML, neural networks, and cognitive computing
- Understanding of AI model deployment and scaling
- Knowledge of AI ethics and responsible AI practices
- Strong problem-solving and analytical skills

๐Ÿ”Š NLP Engineer:
- Background in linguistics and language models
- Proficiency with NLP libraries (e.g., NLTK, spaCy)
- Experience with text preprocessing and tokenization
- Understanding of sentiment analysis, text classification, and named entity recognition
- Familiarity with transformer models like BERT and GPT
- Ability to work with large text datasets and sequential data
๐Ÿ‘8โค1