- What is Python? What are the benefits of using Python
- What is a dynamically typed language?
- What is an Interpreted language?
- What is PEP 8 and why is it important?
- What is Scope in Python?
- What are lists and tuples? What is the key difference between the two?
- What are the common built-in data types in Python?
- What is pass in Python?
- What are modules and packages in Python?
- What are global, protected and private attributes in Python?
- What is the use of self in Python?
- What is init?
- What is break, continue and pass in Python?
- What are unit tests in Python?
- What is docstring in Python?
- What is slicing in Python?
- Explain how can you make a Python Script executable on Unix?
- What is the difference between Python Arrays and lists? Python Interview Questions for Experienced
- How is memory managed in Python?
- What are Python namespaces? Why are they used?
- What is Scope Resolution in Python?
- What are decorators in Python?
- What are Dict and List comprehensions?
- What is lambda in Python? Why is it used?
- How do you copy an object in Python?
- What is the difference between xrange and range in Python?
- What is pickling and unpickling?
- What are generators in Python?
- What is PYTHONPATH in Python?
- What is the use of help() and dir() functions?
- What is the difference between .py and .pyc files?
- How Python is interpreted?
- How are arguments passed by value or by reference in python?
- What are iterators in Python?
- Explain how to delete a file in Python?
- Explain split() and join() functions in Python?
- What does *args and **kwargs mean?
- What are negative indexes and why are they used?
- How do you create a class in Python?
- How does inheritance work in python? Explain it with an example.
- How do you access parent members in the child class?
- Are access specifiers used in python?
- Is it possible to call parent class without its instance creation?
- How is an empty class created in python?
- Differentiate between new and override modifiers.
- Why is finalize used?
- What is init method in python?
- How will you check if a class is a child of another class?
- What do you know about pandas?
- Define pandas dataframe.
- How will you combine different pandas dataframes?
- Can you create a series from the dictionary object in pandas?
- How will you identify and deal with missing values in a dataframe?
- What do you understand by reindexing in pandas?
- How to add new column to pandas dataframe?
- How will you delete indices, rows and columns from a dataframe?
- Can you get items of series A that are not available in another series B?
- How will you get the items that are not common to both the given series A and B?
- While importing data from different sources, can the pandas library recognize dates?
- What do you understand by NumPy?
- How are NumPy arrays advantageous over python lists?
- What are the steps to create 1D, 2D and 3D arrays?
- You are given a numpy array and a new column as inputs. How will you delete the second column and replace the column with a new column value?
- How will you efficiently load data from a text file?
- How will you read CSV data into an array in NumPy?
- How will you sort the array based on the Nth column?
- How will you find the nearest value in a given numpy array?
- How will you reverse the numpy array using one line of code?
- How will you find the shape of any given NumPy array? Python Libraries Interview Questions
- Differentiate between a package and a module in python.
- What are some of the most commonly used built-in modules in Python?
- What are lambda functions?
- How can you generate random numbers?
- Can you easily check if all characters in the given string is alphanumeric?
- What are the differences between pickling and unpickling?
- Define GIL.
- Define PYTHONPATH.
- Define PIP.
- Are there any tools for identifying bugs and performing static analysis in python?
- Differentiate between deep and shallow copies.
- What is main function in python? How do you invoke it?
- Write python function which takes a variable number of arguments.
- WAP (Write a program) which takes a sequence of numbers and check if all numbers are unique.
- Write a program for counting the number of every character of a given text file.
- Write a program to check and return the pairs of a given array A whose sum value is equal to a target value N.
- Write a Program to add two integers >0 without using the plus operator.
- Write a Program to solve the given equation assuming that a,b,c,m,n,o are constants:
- Write a Program to match a string that has the letter ‘a’ followed by 4 to 8 'b’s.
- Write a Program to convert date from yyyy-mm-dd format to dd-mm-yyyy format.
- Write a Program to combine two different dictionaries. While combining, if you find the same keys, you can add the values of these same keys. Output the new dictionary
- What is Data Science?
- What is the difference between data analytics and data science?
- What are some of the techniques used for sampling? What is the main advantage of sampling?
- List down the conditions for Overfitting and Underfitting.
- Differentiate between the long and wide format data.
- What are Eigenvectors and Eigenvalues?
- What does it mean when the p-values are high and low?
- When is resampling done?
- What do you understand by Imbalanced Data?
- Are there any differences between the expected value and mean value?
- What do you understand by Survivorship Bias?
- Define the terms KPI, lift, model fitting, robustness and DOE.
- Define confounding variables.
- Define and explain selection bias?
- Define bias-variance trade-off?
- Define the confusion matrix?
- What is logistic regression? State an example where you have recently used logistic regression.
- What is Linear Regression? What are some of the major drawbacks of the linear model?
- What is a random forest? Explain it’s working.
- In a time interval of 15-minutes, the probability that you may see a shooting star or a bunch of them is 0.2. What is the percentage chance of you seeing at least one star shooting from the sky if you are under it for about an hour?
- What is deep learning? What is the difference between deep learning and machine learning?
- What is a Gradient and Gradient Descent?
- How are the time series problems different from other regression problems?
- What are RMSE and MSE in a linear regression model?
- What are Support Vectors in SVM (Support Vector Machine)?
- So, you have done some projects in machine learning and data science and we see you are a bit experienced in the field. Let’s say your laptop’s RAM is only 4GB and you want to train your model on 10GB data set.
- Explain Neural Network Fundamentals.
- What is Generative Adversarial Network?
- What is a computational graph?
- What are auto-encoders?
- What are Exploding Gradients and Vanishing Gradients?
- What is the p-value and what does it indicate in the Null Hypothesis?
- Since you have experience in the deep learning field, can you tell us why TensorFlow is the most preferred library in deep learning?
- Suppose there is a dataset having variables with missing values of more than 30%, how will you deal with such a dataset?
- What is Cross-Validation?
- What are the differences between correlation and covariance?
- How do you approach solving any data analytics based project?
- How regularly must we update an algorithm in the field of machine learning?
- Why do we need selection bias?
- Why is data cleaning crucial? How do you clean the data?
- What are the available feature selection methods for selecting the right variables for building efficient predictive models?
- During analysis, how do you treat the missing values?
- Will treating categorical variables as continuous variables result in a better predictive model?
- How will you treat missing values during data analysis?
- What does the ROC Curve represent and how to create it?
- What are the differences between univariate, bivariate and multivariate analysis?
- What is the difference between the Test set and validation set?
- What do you understand by a kernel trick?
- Differentiate between box plot and histogram.
- How will you balance/correct imbalanced data?
- What is better - random forest or multiple decision trees?
- Consider a case where you know the probability of finding at least one shooting star in a 15-minute interval is 30%. Evaluate the probability of finding at least one shooting star in a one-hour duration?
- Toss the selected coin 10 times from a jar of 1000 coins. Out of 1000 coins, 999 coins are fair and 1 coin is double-headed, assume that you see 10 heads. Estimate the probability of getting a head in the next coin toss.
- What are some examples when false positive has proven important than false negative?
- Give one example where both false positives and false negatives are important equally?
- Is it good to do dimensionality reduction before fitting a Support Vector Model?
- What are various assumptions used in linear regression? What would happen if they are violated?
- How is feature selection performed using the regularization method?
- How do you identify if a coin is biased?
- What is the importance of dimensionality reduction?
- How is the grid search parameter different from the random search tuning strategy?
- How do I prepare for a data science interview?
- Are data science interviews hard?
- What are the top 3 technical skills of a data scientist?
- Is data science a good career?
- Are coding questions asked in data science interviews?
- Is python and SQL enough for data science?
- What are Data Science tools?
- Why was Machine Learning Introduced?
- What are Different Types of Machine Learning algorithms?
- What is Supervised Learning?
- What is Unsupervised Learning?
- What is ‘Naive’ in a Naive Bayes?
- What is PCA? When do you use it?
- Explain SVM Algorithm in Detail
- What are Support Vectors in SVM?
- What are Different Kernels in SVM?
- What is Cross-Validation?
- What is Bias in Machine Learning?
- Explain the Difference Between Classification and Regression?
- What is F1 score? How would you use it?
- Define Precision and Recall?
- How to Tackle Overfitting and Underfitting?
- What is a Neural Network?
- What are Loss Function and Cost Functions? Explain the key Difference Between them?
- What is Ensemble learning?
- How do you make sure which Machine Learning Algorithm to use?
- How to Handle Outlier Values?
- What is a Random Forest? How does it work?
- What is Collaborative Filtering? And Content-Based Filtering?
- What is Clustering?
- How can you select K for K-means Clustering?
- What are Recommender Systems?
- How do check the Normality of a dataset?
- Can logistic regression use for more than 2 classes?
- Explain Correlation and Covariance?
- What is P-value?
- What are Parametric and Non-Parametric Models?
- What is Reinforcement Learning?
- Difference Between Sigmoid and Softmax functions?
- What is linear regression, and how does it work?
- What are the assumptions of a linear regression model?
- What are outliers? How do you detect and treat them? How do you deal with outliers in a linear regression model?
- How do you determine the best fit line for a linear regression model?
- What is the difference between simple and multiple linear regression?
- What is linear Regression Analysis?
- What is multicollinearity and how does it affect linear regression analysis?
- What is the difference between linear regression and logistic regression?
- What are the common types of errors in linear regression analysis?
- What is the difference between a dependent and independent variable in linear regression?
- What is an interaction term in linear regression and how is it used?
- What is the difference between biased and unbiased estimates in linear regression?
- How do you measure the strength of a linear relationship between two variables?
- What is the difference between a population regression line and a sample regression line?
- What is the difference between linear regression and non-linear regression?
- What are the common techniques used to improve the accuracy of a linear regression model?
- What is a residual in linear regression and how is it used in model evaluation?
- What is the difference between a parametric and non-parametric regression model?
- What are the assumptions of the ordinary least squares method for linear
- How do you determine the significance of a predictor variable in a linear regression model?
- What is the role of a dummy variable in linear regression analysis?
- What is heteroscedasticity?
- What is the difference between a categorical and continuous variable in linear regression?
- What is the impact of correlated predictor variables on linear regression analysis?
- How do you evaluate the goodness of fit of a linear regression model?
- What is the role of a regression coefficient in linear regression analysis?
- What is a prediction interval in linear regression and how is it used?
- How to find RMSE and MSE?
- How do you test for autocorrelation in a linear regression model?
- What are the common challenges faced when building a linear regression model?
- Can you explain the concept of collinearity and how it affects a linear regression model?
- How do you choose the right variables for a linear regression model?
- What is the role of regularization techniques in preventing overfitting in linear regression?
- Can you explain the concept of overfitting in linear regression?
- What are the limitations of linear regression?
- What are the possible ways of improving the accuracy of a linear regression model?
- Can you explain the concept of bias-variance tradeoff in linear regression?
- Can you explain the difference between a linear regression model that assumes homoscedasticity and one that assumes heteroscedasticity?
- What is the difference between a linear regression model with a linear relationship and one with a non-linear relationshi
- What is the curse of dimensionality? Can you give an example?
- What is the difference between correlation and regression?
- What is the main problem with using a single regression line?
- What does locally weighted regression results depend on?
- Which of the following is the simplest error detection method?
- If you have only one independent variable, how many coefficients will you require to estimate in a simple linear regression model?
- What is the performance of the model aer adding a non important feature to a linear regression model?
- Linearity in regression corresponds to what ?
- Which of the following plots is best suited to test the linear relationship of independent and dependent continuous variables?
- What is the primary difference between R squared and adjusted R squared?
- What is the importance of the F-test in a linear model?
- Explain the Gradient Descent algorithm with respect to linear regression?
- For a linear regression model, how do we interpret a Q-Q plot?
- What are MAE and MAPE?