Data Science & Machine Learning
73.1K subscribers
779 photos
2 videos
68 files
686 links
Join this channel to learn data science, artificial intelligence and machine learning with funny quizzes, interesting projects and amazing resources for free

For collaborations: @love_data
Download Telegram
Machine Learning Algorithm
โค11
๐’๐๐‹ ๐‚๐š๐ฌ๐ž ๐’๐ญ๐ฎ๐๐ข๐ž๐ฌ ๐Ÿ๐จ๐ซ ๐ˆ๐ง๐ญ๐ž๐ซ๐ฏ๐ข๐ž๐ฐ:

Join for more: https://t.iss.one/sqlanalyst

1. Dannyโ€™s Diner:
Restaurant analytics to understand the customer orders pattern.
Link: https://8weeksqlchallenge.com/case-study-1/

2. Pizza Runner
Pizza shop analytics to optimize the efficiency of the operation
Link: https://8weeksqlchallenge.com/case-study-2/

3. Foodie Fie
Subscription-based food content platform
Link: https://lnkd.in/gzB39qAT

4. Data Bank: Thatโ€™s money
Analytics based on customer activities with the digital bank
Link: https://lnkd.in/gH8pKPyv

5. Data Mart: Fresh is Best
Analytics on Online supermarket
Link: https://lnkd.in/gC5bkcDf

6. Clique Bait: Attention capturing
Analytics on the seafood industry
Link: https://lnkd.in/ggP4JiYG

7. Balanced Tree: Clothing Company
Analytics on the sales performance of clothing store
Link: https://8weeksqlchallenge.com/case-study-7

8. Fresh segments: Extract maximum value
Analytics on online advertising
Link: https://8weeksqlchallenge.com/case-study-8
โค4
Getting a job in 2017:

Apply, get interview, get offer, negotiate salary, start job.

Getting a job in 2025:

Find job you are overqualified for that is underpaying market rates, connect with current employees and ask for a recommendation, bake a cake for the potential team youโ€™ll be apart of and hope your efforts are better than other candidates, meet with the third cousin of the hiring manager to see if you are a good fit to maybe start the process of interviewing, take a 3-hour long pass
โค7
Cold email template for Freshers ๐Ÿ‘‡

Dear {NAME},

I hope this email finds you in good health and high spirits. I am writing to express my keen interest in the internship opportunity at the {NAME} and to submit my application for your consideration.


Allow me to introduce myself. My name is Ashok Aggarwal, and I am a statistics major with a specialization in Data Science. I have been following the remarkable work conducted by {NAME} and the valuable contributions it has made to the field of biomedical research and public health. I am truly inspired by the {One USP}


Having reviewed the internship description and requirements, I firmly believe that my academic background and skills make me a strong candidate for this opportunity. I have a solid foundation in statistics and data analysis, along with proficiency in relevant software such as Python, NumPy, Pandas, and visualization tools like Matplotlib. Furthermore, my prior project on {xyz} has reinforced my passion for utilizing data-driven insights to understand {XYZ}


Joining {name} for this internship would provide me with a tremendous platform to contribute my statistical expertise and collaborate with esteemed scientists like yourself. I am eager to work closely with the research team, assist in communications campaigns, engage in community programs, and learn from the collective expertise at {Name}.


I have attached my resume and would be grateful if you could review my application. I am available for an interview at your convenience to further discuss my qualifications and how I can contribute to {NAME} initiatives. I genuinely appreciate your time and consideration.


Thank you for your attention to my application. I look forward to the possibility of joining {NAME} and making a meaningful contribution to the organization's mission. Should you require any further information or documentation, please do not hesitate to contact me.

Wishing you a productive day ahead.


Sincerely,

{Full Name}
โค5
Data Science vs. Data Analytics
โค4
Handling Datasets of All Types โ€“ Part 1 of 5: Introduction and Basic Concepts โ˜‘๏ธ


1. What is a Dataset?

โ€ข A dataset is a structured collection of data, usually organized in rows and columns, used for analysis or training machine learning models.

2. Types of Datasets

โ€ข Structured Data: Tables, spreadsheets with rows and columns (e.g., CSV, Excel).

โ€ข Unstructured Data: Images, text, audio, video.

โ€ข Semi-structured Data: JSON, XML files containing hierarchical data.

3. Common Dataset Formats

โ€ข CSV (Comma-Separated Values)

โ€ข Excel (.xls, .xlsx)

โ€ข JSON (JavaScript Object Notation)

โ€ข XML (eXtensible Markup Language)

โ€ข Images (JPEG, PNG, TIFF)

โ€ข Audio (WAV, MP3)


4. Loading Datasets in Python

โ€ข Use libraries like pandas for structured data:

import pandas as pd
df = pd.read_csv('data.csv')


โ€ข Use libraries like json for JSON files:

import json
with open('data.json') as f:
    data = json.load(f)



5. Basic Dataset Exploration

โ€ข Check shape and size:

print(df.shape)


โ€ข Preview data:

print(df.head())


โ€ข Check for missing values:

print(df.isnull().sum())



6. Summary

โ€ข Understanding dataset types is crucial before processing.

โ€ข Loading and exploring datasets helps identify cleaning and preprocessing needs.


Exercise

โ€ข Load a CSV and JSON dataset in Python, print their shapes, and identify missing values.

Hope this helped you โœ”๏ธ
Please open Telegram to view this post
VIEW IN TELEGRAM
โค5๐Ÿ‘2
Convolutional Neural Network Cheat Sheet
โค1
What's the correct answer ๐Ÿ‘‡๐Ÿ‘‡
โค2
Data Science & Machine Learning
What's the correct answer ๐Ÿ‘‡๐Ÿ‘‡
a = "10" โ†’ Variable a is assigned the string "10".

b = a โ†’ Variable b also holds the string "10" (but it's not used afterward).

a = a * 2 โ†’ Since a is a string, multiplying it by an integer results in string repetition.

"10" * 2 results in "1010"

print(a) โ†’ prints the new value of a, which is "1010".


โœ… Correct answer: D. 1010
โค5
๐Ÿ”ฐ Python Question / Quiz

What is the output of the following Python code?
โค2
How much Statistics must I know to become a Data Scientist?

This is one of the most common questions

Here are the must-know Statistics concepts every Data Scientist should know:

๐—ฃ๐—ฟ๐—ผ๐—ฏ๐—ฎ๐—ฏ๐—ถ๐—น๐—ถ๐˜๐˜†

โ†—๏ธ Bayes' Theorem & conditional probability
โ†—๏ธ Permutations & combinations
โ†—๏ธ Card & die roll problem-solving

๐——๐—ฒ๐˜€๐—ฐ๐—ฟ๐—ถ๐—ฝ๐˜๐—ถ๐˜ƒ๐—ฒ ๐˜€๐˜๐—ฎ๐˜๐—ถ๐˜€๐˜๐—ถ๐—ฐ๐˜€ & ๐—ฑ๐—ถ๐˜€๐˜๐—ฟ๐—ถ๐—ฏ๐˜‚๐˜๐—ถ๐—ผ๐—ป๐˜€

โ†—๏ธ Mean, median, mode
โ†—๏ธ Standard deviation and variance
โ†—๏ธ  Bernoulli's, Binomial, Normal, Uniform, Exponential distributions

๐—œ๐—ป๐—ณ๐—ฒ๐—ฟ๐—ฒ๐—ป๐˜๐—ถ๐—ฎ๐—น ๐˜€๐˜๐—ฎ๐˜๐—ถ๐˜€๐˜๐—ถ๐—ฐ๐˜€

โ†—๏ธ A/B experimentation
โ†—๏ธ T-test, Z-test, Chi-squared tests
โ†—๏ธ Type 1 & 2 errors
โ†—๏ธ Sampling techniques & biases
โ†—๏ธ Confidence intervals & p-values
โ†—๏ธ Central Limit Theorem
โ†—๏ธ Causal inference techniques

๐— ๐—ฎ๐—ฐ๐—ต๐—ถ๐—ป๐—ฒ ๐—น๐—ฒ๐—ฎ๐—ฟ๐—ป๐—ถ๐—ป๐—ด

โ†—๏ธ Logistic & Linear regression
โ†—๏ธ Decision trees & random forests
โ†—๏ธ Clustering models
โ†—๏ธ Feature engineering
โ†—๏ธ Feature selection methods
โ†—๏ธ Model testing & validation
โ†—๏ธ Time series analysis

Math & Statistics: https://whatsapp.com/channel/0029Vat3Dc4KAwEcfFbNnZ3O
โค5๐Ÿ‘1
Random Module in Python ๐Ÿ‘†
๐Ÿ‘5โค4