PyData Careers
20.8K subscribers
206 photos
4 videos
26 files
352 links
Python Data Science jobs, interview tips, and career insights for aspiring professionals.

Admin: @HusseinSheikho || @Hussein_Sheikho
Download Telegram
❔ Interview question

Why are int and bool classes, and not "primitive types" as in other languages?

Answer: Because in Python everything is based on the object model. int, bool, str, and others are built-in classes, and each time you use them you create their instances. For example, 5 is an object of the int class.

Even the classes themselves, like int, are also objects. They are created using a special object called type, which is the default metaclass. Therefore, type(int) returns type.


tags: #interview
Please open Telegram to view this post
VIEW IN TELEGRAM
❀7
Forwarded from Code With Python
Python.pdf
488 KB
πŸ³οΈβ€πŸŒˆ Notes "Mastering Python"
βœ… From Basic to Advanced

πŸ‘¨πŸ»β€πŸ’» An excellent note that teaches everything from basic concepts to building professional projects with Python.

⭕️ Basic concepts like variables, data types, and control flow

⏺ Functions, modules, and writing reusable code

⭕️ Data structures like lists, dictionaries, sets, and tuples

⏺ Object-oriented programming: classes, inheritance, and polymorphism

⭕️ Working with files, error handling, and debugging

⬅️ Alongside, with practical projects like data analysis, web scraping, and working with APIs, you learn how to apply Python in the real world.

🌐 #Data_Science #DataScience
βž–βž–βž–βž–βž–βž–βž–βž–βž–βž–βž–βž–βž–
Please open Telegram to view this post
VIEW IN TELEGRAM
❀2πŸ”₯1
❓ Interview question

Why are strings immutable in Python?

Answer: Because once a string is created, its value cannot be changed. This ensures that strings can be safely used in hashing, as dictionary keys, and in multithreaded environments without causing data corruption. Instead of modifying the string, Python creates a new string object when any operation changes it. This behavior improves performance and security but requires more memory for frequent modifications.

tags: #interview #python #strings

By: t.iss.one/DataScienceQ πŸš€
❀1
❓ Interview question

What is a generator in Python?

Answer: A generator is a special type of function that returns an iterator using the `yield` keyword instead of `return`. It allows you to generate a sequence of values lazily, meaning values are produced on-demand rather than all at once. This saves memory and improves performance when dealing with large datasets.

tags: #python #interview #generator #iterator #programming

By: t.iss.one/DataScienceQ πŸš€
❓ Interview question :
What is the Transformer architecture, and why is it considered a breakthrough in NLP?

❓ Interview question :
How does self-attention enable Transformers to capture long-range dependencies in text?

❓ Interview question :
What are the main components of a Transformer model?

❓ Interview question :
Why are positional encodings essential in Transformers?

❓ Interview question :
How does multi-head attention improve Transformer performance compared to single-head attention?

❓ Interview question :
What is the purpose of feed-forward networks in the Transformer architecture?

❓ Interview question :
How do residual connections and layer normalization contribute to training stability in Transformers?

❓ Interview question :
What is the difference between encoder and decoder in the Transformer model?

❓ Interview question :
Why can Transformers process sequences in parallel, unlike RNNs?

❓ Interview question :
How does masked self-attention work in the decoder of a Transformer?

❓ Interview question :
What is the role of key, query, and value in attention mechanisms?

❓ Interview question :
How do attention weights determine which parts of input are most relevant?

❓ Interview question :
What are the advantages of using scaled dot-product attention in Transformers?

❓ Interview question :
How does position-wise feed-forward network differ from attention layers in Transformers?

❓ Interview question :
Why is pre-training important for large Transformer models like BERT and GPT?

❓ Interview question :
How do fine-tuning and transfer learning benefit Transformer-based models?

❓ Interview question :
What are the limitations of Transformers in terms of computational cost and memory usage?

❓ Interview question :
How do sparse attention and linear attention address scalability issues in Transformers?

❓ Interview question :
What is the significance of model size (e.g., number of parameters) in Transformer performance?

❓ Interview question :
How do attention heads in multi-head attention capture different types of relationships in data?

#️⃣ tags: #Transformer #NLP #DeepLearning #SelfAttention #MultiHeadAttention #PositionalEncoding #FeedForwardNetwork #EncoderDecoder

By: t.iss.one/DataScienceQ πŸš€
❀2
❓ Interview question :
What is NumPy, and why is it essential for scientific computing in Python?

❓ Interview question :
How do arrays in NumPy differ from Python lists?

❓ Interview question :
What is the purpose of ndarray in NumPy?

❓ Interview question :
How can you create a 2D array using NumPy?

❓ Interview question :
What does shape represent in a NumPy array?

❓ Interview question :
How do you perform element-wise operations on NumPy arrays?

❓ Interview question :
What is broadcasting in NumPy, and how does it work?

❓ Interview question :
How do you reshape a NumPy array using reshape()?

❓ Interview question :
What is the difference between copy() and view() in NumPy?

❓ Interview question :
How do you concatenate two NumPy arrays along a specific axis?

❓ Interview question :
What is the role of axis parameter in NumPy functions like sum(), mean(), etc.?

❓ Interview question :
How do you find the maximum and minimum values in a NumPy array?

❓ Interview question :
What are ufuncs in NumPy, and give an example?

❓ Interview question :
How do you sort a NumPy array using np.sort()?

❓ Interview question :
What is the use of np.where() in conditional indexing?

❓ Interview question :
How do you generate random numbers using NumPy?

❓ Interview question :
What is the difference between np.random.rand() and np.random.randn()?

❓ Interview question :
How do you load data from a file into a NumPy array?

❓ Interview question :
What is vectorization in NumPy, and why is it important?

❓ Interview question :
How do you calculate the dot product of two arrays in NumPy?

#️⃣ tags: #NumPy #Python #ScientificComputing #Array #ndarray #ElementWiseOperations #Broadcasting #Reshape #CopyView #Concatenation #AxisParameter #MaximumMinimum #ufuncs #Sorting #ConditionalIndexing #RandomNumbers #DataLoading #Vectorization #DotProduct

By: t.iss.one/DataScienceQ πŸš€
1. What is the primary data structure in pandas?
2. How do you create a DataFrame from a dictionary?
3. Which method is used to read a CSV file in pandas?
4. What does the head() function do in pandas?
5. How can you check the data types of columns in a DataFrame?
6. Which function drops rows with missing values in pandas?
7. What is the purpose of the merge() function in pandas?
8. How do you filter rows based on a condition in pandas?
9. What does the groupby() method do?
10. How can you sort a DataFrame by a specific column?
11. Which method is used to rename columns in pandas?
12. What is the difference between loc and iloc in pandas?
13. How do you handle duplicate rows in pandas?
14. What function converts a column to datetime format?
15. How do you apply a custom function to a DataFrame?
16. What is the use of the apply() method in pandas?
17. How can you concatenate two DataFrames?
18. What does the pivot_table() function do?
19. How do you calculate summary statistics in pandas?
20. Which method is used to export a DataFrame to a CSV file?

#️⃣ #pandas #dataanalysis #python #dataframe #coding #programming #datascience

By: t.iss.one/DataScienceQ πŸš€
❀1
1. What is the primary purpose of PHP?
2. How do you declare a variable in PHP?
3. Which symbol starts a PHP code block?
4. What is the difference between echo and print in PHP?
5. How do you create an array in PHP?
6. Which function is used to get the length of a string in PHP?
7. What is the use of the isset() function in PHP?
8. How do you handle form data in PHP?
9. What does the $_GET superglobal contain?
10. How can you include another PHP file in your script?
11. What is the purpose of the require_once statement?
12. How do you define a function in PHP?
13. What is the difference between == and === in PHP?
14. How do you connect to a MySQL database using PHP?
15. Which function executes a SQL query in PHP?
16. What is the use of the mysqli_fetch_assoc() function?
17. How do you start a session in PHP?
18. What is the purpose of the session_start() function?
19. How do you redirect a user to another page in PHP?
20. What is the use of the header() function in PHP?

#️⃣ #php #webdevelopment #coding #programming #backend #scripting #serverside #dev

By: t.iss.one/DataScienceQ πŸš€
Please open Telegram to view this post
VIEW IN TELEGRAM
❔ Interview question

What is the output of the following code?
x = [1, 2, 3]
y = x
y.append(4)
print(x)

Answer:
[1, 2, 3, 4]

tags: #python #interview #coding #programming #datastructures #list #mutable #dev

By: t.iss.one/DataScienceQ πŸš€
❔ Interview question

**What will be the output of this code?**
```python
x = [1, 2, 3]
y = x[:]
y[0] = 10
print(x)
```

Answer:
<spoiler>||[1, 2, 3]||</spoiler>

tags: #python #interview #coding #programming #list #slicing #mutable #dev

By: t.iss.one/DataScienceQ πŸš€
❔ Interview question

What is the output of the following code?
def my_func():
return "hello", "world"

result = my_func()
print(type(result))

Answer:
<class 'tuple'>

tags: #python #interview #coding #programming #function #returnvalues #tuple #dev

By: t.iss.one/DataScienceQ πŸš€
❔ Interview question

What does the following code do?
import os
os.makedirs("folder/subfolder", exist_ok=True)

Answer:
Creates a directory named 'folder' with a subdirectory 'subfolder' if it doesn't already exist

tags: #python #os #filehandling #coding #programming #directory #makedirs #dev

By: t.iss.one/DataScienceQ πŸš€
❀1
KMeans Interview Questions

❓ What is the primary goal of KMeans clustering?

Answer:
To partition data into K clusters based on similarity, minimizing intra-cluster variance

❓ How does KMeans determine the initial cluster centers?

Answer:
By randomly selecting K data points as initial centroids

❓ What is the main limitation of KMeans regarding cluster shape?

Answer:
It assumes spherical and equally sized clusters, struggling with non-spherical shapes

❓ How do you choose the optimal number of clusters (K) in KMeans?

Answer:
Using methods like the Elbow Method or Silhouette Score

❓ What is the role of the inertia metric in KMeans?

Answer:
Measures the sum of squared distances from each point to its cluster center

❓ Can KMeans handle categorical data directly?

Answer:
No, it requires numerical data; categorical variables must be encoded

❓ How does KMeans handle outliers?

Answer:
Outliers can distort cluster centers and increase inertia

❓ What is the difference between KMeans and KMedoids?

Answer:
KMeans uses mean of points, while KMedoids uses actual data points as centers

❓ Why is feature scaling important for KMeans?

Answer:
To ensure all features contribute equally and prevent dominance by large-scale features

❓ How does KMeans work in high-dimensional spaces?

Answer:
It suffers from the curse of dimensionality, making distance measures less meaningful

❓ What is the time complexity of KMeans?

Answer:
O(n * k * t), where n is samples, k is clusters, and t is iterations

❓ What is the space complexity of KMeans?

Answer:
O(k * d), where k is clusters and d is features

❓ How do you evaluate the quality of KMeans clustering?

Answer:
Using metrics like silhouette score, within-cluster sum of squares, or Davies-Bouldin index

❓ Can KMeans be used for image segmentation?

Answer:
Yes, by treating pixel values as features and clustering them

❓ How does KMeans initialize centroids differently in KMeans++?

Answer:
Centroids are initialized to be far apart, improving convergence speed and quality

❓ What happens if the number of clusters (K) is too small?

Answer:
Clusters may be overly broad, merging distinct groups

❓ What happens if the number of clusters (K) is too large?

Answer:
Overfitting occurs, creating artificial clusters

❓ Does KMeans guarantee a global optimum?

Answer:
No, it converges to a local optimum depending on initialization

❓ How can you improve KMeans performance on large datasets?

Answer:
Using MiniBatchKMeans or sampling techniques

❓ What is the effect of random seed on KMeans results?

Answer:
Different seeds lead to different initial centroids, affecting final clusters

#️⃣ #kmeans #machine_learning #clustering #data_science #ai #python #coding #dev

By: t.iss.one/DataScienceQ πŸš€
Genetic Algorithms Interview Questions

❓ What is the primary goal of Genetic Algorithms (GA)?

Answer:
To find optimal or near-optimal solutions to complex optimization problems using principles of natural selection

❓ How does a Genetic Algorithm mimic biological evolution?

Answer:
By using selection, crossover, and mutation to evolve a population of solutions over generations

❓ What is a chromosome in Genetic Algorithms?

Answer:
A representation of a potential solution encoded as a string of genes

❓ What is the role of the fitness function in GA?

Answer:
To evaluate how good a solution is and guide the selection process

❓ How does selection work in Genetic Algorithms?

Answer:
Better-performing individuals are more likely to be chosen for reproduction

❓ What is crossover in Genetic Algorithms?

Answer:
Combining parts of two parent chromosomes to create offspring

❓ What is the purpose of mutation in GA?

Answer:
Introducing small random changes to maintain diversity and avoid local optima

❓ Why is elitism used in Genetic Algorithms?

Answer:
To preserve the best solutions from one generation to the next

❓ What is the difference between selection and reproduction in GA?

Answer:
Selection chooses which individuals will reproduce; reproduction creates new offspring

❓ How do you represent real-valued variables in a Genetic Algorithm?

Answer:
Using floating-point encoding or binary encoding with appropriate decoding

❓ What is the main advantage of Genetic Algorithms?

Answer:
They can solve complex, non-linear, and multi-modal optimization problems without requiring derivatives

❓ What is the main disadvantage of Genetic Algorithms?

Answer:
They can be computationally expensive and may converge slowly

❓ Can Genetic Algorithms guarantee an optimal solution?

Answer:
No, they provide approximate solutions, not guaranteed optimality

❓ How do you prevent premature convergence in GA?

Answer:
Using techniques like adaptive mutation rates or niching

❓ What is the role of population size in Genetic Algorithms?

Answer:
Larger populations increase diversity but also increase computation time

❓ How does crossover probability affect GA performance?

Answer:
Higher values increase genetic mixing, but too high may disrupt good solutions

❓ What is the effect of mutation probability on GA?

Answer:
Too low reduces exploration; too high turns GA into random search

❓ Can Genetic Algorithms be used for feature selection?

Answer:
Yes, by encoding features as genes and optimizing subset quality

❓ How do you handle constraints in Genetic Algorithms?

Answer:
Using penalty functions or repair mechanisms to enforce feasibility

❓ What is the difference between steady-state and generational GA?

Answer:
Steady-state replaces only a few individuals per generation; generational replaces the entire population

#️⃣ #genetic_algorithms #optimization #machine_learning #ai #evolutionary_computing #coding #python #dev

By: t.iss.one/DataScienceQ πŸš€
❔ Interview question

What is the output of the following code?
def outer():
x = 10
def inner():
nonlocal x
x += 5
return x
return inner()

result = outer()
print(result)

Answer:
15

tags: #python #advanced #coding #programming #interview #nonlocal #function #dev

By: t.iss.one/DataScienceQ πŸš€
⁉️ Interview question

What is the output of the following code?
import copy

a = [1, 2, [3, 4]]
b = copy.deepcopy(a)
b[2][0] = 'X'
print(a[2][0])

Answer:
3

#⃣ tags: #python #advanced #coding #programming #interview #deepcopy #mutable #dev

By: t.iss.one/DataScienceQ πŸš€
⁉️ Interview question

What is the output of the following code?
def func(a, b=[]):
b.append(a)
return b

print(func(1))
print(func(2))

Answer:
[1, 2]

#⃣ tags: #python #advanced #coding #programming #interview #defaultarguments #mutable #dev

By: t.iss.one/DataScienceQ πŸš€
⁉️ Interview question

What is the output of the following code?
class A:
def __init__(self):
self.x = 1

def __str__(self):
return str(self.x)

a = A()
print(a)


Answer:
1

#️⃣ tags: #python #advanced #coding #programming #interview #strmethod #object #dev

By: t.iss.one/DataScienceQ πŸš€
⁉️ Interview question
What happens when you use __enter__ and __exit__ methods in a context manager that opens a file with mode 'r+' but the file is simultaneously being written to by another process using os.fsync()? How does Python’s internal buffering interact with system-level synchronization mechanisms, and what potential race conditions could arise if the file is not properly closed?

When the file is opened in `'r+'` mode, Python's buffered I/O interacts with the OS's `fsync()` call, which forces data to be written to disk immediately. However, if another process calls `fsync()` while the Python context manager is still active, the buffer might contain stale or partially written data, leading to inconsistent reads. The `__exit__` method may flush the buffer before closing, but if the external process has already synced, the file content can become corrupted due to overlapping write operations. This scenario highlights the importance of using atomic operations or file locks (e.g., `fcntl`) when sharing files across processes.

#️⃣ tags: #Python #AdvancedPython #FileHandling #ContextManager #Multithreading #RaceCondition #OSInteraction #Buffering #Synchronization #ProgrammingInterview

By: t.iss.one/DataScienceQ πŸš€
Please open Telegram to view this post
VIEW IN TELEGRAM
⁉️ Interview question
How does Python’s mmap module behave when mapping a file that is concurrently being truncated by another process using os.ftruncate()? What are the implications for memory safety, and under what conditions might this lead to segmentation faults or undefined behavior?

When a file is mapped via `mmap` and simultaneously truncated by another process, the virtual memory pages remain valid until accessed. However, if the mapped region refers to data beyond the new file size, accessing those pages results in undefined behavior, potentially causing segmentation faults. The operating system may not immediately invalidate the mappings, leading to crashes or data corruption. This scenario highlights the need for synchronization mechanisms like file locks or signals to ensure safe concurrent access

#️⃣ tags: #Python #AdvancedPython #FileHandling #MemoryMapping #mmap #ConcurrentProgramming #OS #SystemCalls #UndefinedBehavior #SegmentationFault #FileLocking

By: t.iss.one/DataScienceQ πŸš€
Please open Telegram to view this post
VIEW IN TELEGRAM