Machine Learning & Artificial Intelligence | Data Science Free Courses
64.3K subscribers
557 photos
2 videos
98 files
425 links
Perfect channel to learn Data Analytics, Data Sciene, Machine Learning & Artificial Intelligence

Admin: @coderfun
Download Telegram
The Data Science skill no one talks about...

Every aspiring data scientist I talk to thinks their job starts when someone else gives them:
    1. a dataset, and
    2. a clearly defined metric to optimize for, e.g. accuracy

But it doesnโ€™t.

It starts with a business problem you need to understand, frame, and solve. This is the key data science skill that separates senior from junior professionals.

Letโ€™s go through an example.

Example

Imagine you are a data scientist at Uber. And your product lead tells you:

    ๐Ÿ‘ฉโ€๐Ÿ’ผ: โ€œWe want to decrease user churn by 5% this quarterโ€


We say that a user churns when she decides to stop using Uber.

But why?

There are different reasons why a user would stop using Uber. For example:

   1.  โ€œLyft is offering better prices for that geoโ€ (pricing problem)
   2. โ€œCar waiting times are too longโ€ (supply problem)
   3. โ€œThe Android version of the app is very slowโ€ (client-app performance problem)

You build this list โ†‘ by asking the right questions to the rest of the team. You need to understand the userโ€™s experience using the app, from HER point of view.

Typically there is no single reason behind churn, but a combination of a few of these. The question is: which one should you focus on?

This is when you pull out your great data science skills and EXPLORE THE DATA ๐Ÿ”Ž.

You explore the data to understand how plausible each of the above explanations is. The output from this analysis is a single hypothesis you should consider further. Depending on the hypothesis, you will solve the data science problem differently.

For exampleโ€ฆ

Scenario 1: โ€œLyft Is Offering Better Pricesโ€ (Pricing Problem)

One solution would be to detect/predict the segment of users who are likely to churn (possibly using an ML Model) and send personalized discounts via push notifications. To test your solution works, you will need to run an A/B test, so you will split a percentage of Uber users into 2 groups:

    The A group. No user in this group will receive any discount.

    The B group. Users from this group that the model thinks are likely to churn, will receive a price discount in their next trip.

You could add more groups (e.g. C, D, Eโ€ฆ) to test different pricing points.

In a nutshell

    1. Translating business problems into data science problems is the key data science skill that separates a senior from a junior data scientist.
2. Ask the right questions, list possible solutions, and explore the data to narrow down the list to one.
3. Solve this one data science problem
๐Ÿ‘28๐Ÿฅฐ1
1. How can we deal with problems that arise when the data flows in from a variety of sources?

There are many ways to go about dealing with multi-source problems. However, these are done primarily to solve the problems of:

Identifying the presence of similar/same records and merging them into a single recordRe-structuring the schema to ensure there is good schema integration



2. Where is Time Series Analysis used?

Since time series analysis (TSA) has a wide scope of usage, it can be used in multiple domains. Here are some of the places where TSA plays an important role:

Statistics
Signal processing
Econometrics
Weather forecasting
Earthquake prediction
Astronomy
Applied science


3. What are the ideal situations in which t-test or z-test can be used?

It is a standard practice that a t-test is used when there is a sample size less than 30 and the z-test is considered when the sample size exceeds 30 in most cases.


4. What is the usage of the NVL() function?

The NVL() function is used to convert the NULL value to the other value. The function returns the value of the second parameter if the first parameter is NULL. If the first parameter is anything other than NULL, it is left unchanged. This function is used in Oracle, not in SQL and MySQL. Instead of NVL() function, MySQL have IFNULL() and SQL Server have ISNULL() function.


5. What is the difference between DROP and TRUNCATE commands?

If a table is dropped, all things associated with that table are dropped as well. This includes the relationships defined on the table with other tables, access privileges, and grants that the table has, as well as the integrity checks and constraints.

However, if a table is truncated, there are no such problems as mentioned above. The table retains its original structure and the data is dropped.
๐Ÿ‘8๐Ÿ‘Ž1
Learn Data Science in 2024

๐Ÿญ. ๐—”๐—ฝ๐—ฝ๐—น๐˜† ๐—ฃ๐—ฎ๐—ฟ๐—ฒ๐˜๐—ผ'๐˜€ ๐—Ÿ๐—ฎ๐˜„ ๐˜๐—ผ ๐—Ÿ๐—ฒ๐—ฎ๐—ฟ๐—ป ๐—๐˜‚๐˜€๐˜ ๐—˜๐—ป๐—ผ๐˜‚๐—ด๐—ต ๐Ÿ“š

Pareto's Law states that "that 80% of consequences come from 20% of the causes".

This law should serve as a guiding framework for the volume of content you need to know to be proficient in data science.

Often rookies make the mistake of overspending their time learning algorithms that are rarely applied in production. Learning about advanced algorithms such as XLNet, Bayesian SVD++, and BiLSTMs, are cool to learn.

But, in reality, you will rarely apply such algorithms in production (unless your job demands research and application of state-of-the-art algos).

For most ML applications in production - especially in the MVP phase, simple algos like logistic regression, K-Means, random forest, and XGBoost provide the biggest bang for the buck because of their simplicity in training, interpretation and productionization.

So, invest more time learning topics that provide immediate value now, not a year later.

๐Ÿฎ. ๐—™๐—ถ๐—ป๐—ฑ ๐—ฎ ๐— ๐—ฒ๐—ป๐˜๐—ผ๐—ฟ โšก

Thereโ€™s a Japanese proverb that says โ€œBetter than a thousand days of diligent study is one day with a great teacher.โ€ This proverb directly applies to learning data science quickly.

Mentors can teach you about how to build a model in production and how to manage stakeholders - stuff that you donโ€™t often read about in courses and books.

So, find a mentor who can teach you practical knowledge in data science.

๐Ÿฏ. ๐——๐—ฒ๐—น๐—ถ๐—ฏ๐—ฒ๐—ฟ๐—ฎ๐˜๐—ฒ ๐—ฃ๐—ฟ๐—ฎ๐—ฐ๐˜๐—ถ๐—ฐ๐—ฒ โœ๏ธ

If you are serious about growing your excelling in data science, you have to put in the time to nurture your knowledge. This means that you need to spend less time watching mindless videos on TikTok and spend more time reading books and watching video lectures.

Join @datasciencefree for more

ENJOY LEARNING ๐Ÿ‘๐Ÿ‘
๐Ÿ‘20
๐ŸŽฏ5 Certification from Data Science :

๐Ÿ“Python free certification :
https://imp.i115008.net/5bK93j

๐Ÿ“SQL Course :
https://bit.ly/3FxxKPz

๐Ÿ“Data Science Certification :
https://365datascience.pxf.io/q4m66g

๐Ÿ“Data Analysis :
https://imp.i115008.net/gb6ZJ2

Hope this was helpful for you
๐Ÿ‘13๐Ÿ‘Ž3โค1
Optimize your resume to get more interviews

Many job seekers donโ€™t get enough interviews even after applying for dozens of jobs. Why? Companies use Applicant Tracking Systems (ATS) to search and filter resumes by keywords. The Jobscan resume scanner helps you optimize your resume keywords for each job listing so that your application gets found by recruiters.

Link -> https://jobscanco.pxf.io/KjGgAa

ENJOY LEARNING ๐Ÿ‘๐Ÿ‘
๐Ÿ‘11โค1
Machine Learning types
๐Ÿ‘26โค2
Famous ML Project Ideas
๐Ÿ‘‡๐Ÿ‘‡
https://t.iss.one/pythonspecialist/103
๐Ÿ‘10๐Ÿ˜1
BCG Hiring ML Engineer
๐Ÿ‘‡๐Ÿ‘‡
https://t.iss.one/getjobss/1851

Requirements:
Very high proficiency in Python programming language, knowledge of other languages.
such as R, Java would be a plus.
Knowledge of various AI/ML models including deep learning models.
Knowledge of Generative AI stack โ€“ Large Language Models / Foundation Models, vector databases, orchestration stack.
Hand on experience in building AI orchestration with frameworks like LangChain.
Knowledge of vector databases e.g., Pinecone, Chroma etc.
Deep understanding of data processing frameworks e.g., Data Bricks, Airflow etc.
Knowledge of API frameworks Django, Flask etc.
Understanding of cloud data & AI stack on AWS / Azure / GCP is preferred.

ENJOY LEARNING ๐Ÿ‘๐Ÿ‘
๐Ÿ‘9โค3
What if we all are just a part of AI experiment by god- humanโ€™s life created as a unique dataset, contributing to the overall learning process. Creator contemplates the diversity of experiences encoded in the training data, like the complex interplay of joy, sorrow, love, hatred and conflict.

Read more.....
๐Ÿ‘15
Preparing for a data science interview can be challenging, but with the right approach, you can increase your chances of success. Here are some tips to help you prepare for your next data science interview:

๐Ÿ‘‰ 1. Review the Fundamentals: Make sure you have a thorough understanding of the fundamentals of statistics, probability, and linear algebra. You should also be familiar with data structures, algorithms, and programming languages like Python, R, and SQL.

๐Ÿ‘‰ 2. Brush up on Machine Learning: Machine learning is a key aspect of data science. Make sure you have a solid understanding of different types of machine learning algorithms like supervised, unsupervised, and reinforcement learning.

๐Ÿ‘‰ 3. Practice Coding: Practice coding questions related to data structures, algorithms, and data science problems. You can use online resources like HackerRank, LeetCode, and Kaggle to practice.

๐Ÿ‘‰ 4. Build a Portfolio: Create a portfolio of projects that demonstrate your data science skills. This can include data cleaning, data wrangling, exploratory data analysis, and machine learning projects.

๐Ÿ‘‰ 5. Practice Communication: Data scientists are expected to effectively communicate complex technical concepts to non-technical stakeholders. Practice explaining your projects and technical concepts in simple terms.

๐Ÿ‘‰ 6. Research the Company: Research the company you are interviewing with and their industry. Understand how they use data and what data science problems they are trying to solve.

By following these tips, you can be well-prepared for your next data science interview. Good luck!
๐Ÿ‘16โค2