Get Even More Visitors To Your Blog, Upgrade To A Business Listing >>

SAS Interview Questions And Answers Part – 4

Question 61: What is the difference between '+' operator and SUM function?

Answer:

https://www.synergisticit.com/wp-content/uploads/2023/09/Question_61_What_is_the_diffe.mp3

The ‘+’ operator returns a missing value if any arguments are missing, while the SUM function returns the sum of all non-missing arguments.

Question 62: How can you convert a numeric variable to a character variable?

Answer:

https://www.synergisticit.com/wp-content/uploads/2023/09/Question_62_How_can_you_conve.mp3

You can utilize the put() function within SAS to transform a numeric variable into a character variable.

Question 3: Explain the term Q-Learning?

Answer:

https://www.synergisticit.com/wp-content/uploads/2023/03/3_Question_3_Explain_the_term_.mp3

Q-learning is a popular algorithm used in reinforcement learning. It is based on the Bellman equation. In this algorithm, the agent tries to learn the policies that can provide the best actions to perform for maximizing the rewards under particular circumstances. The agent learns these optimal policies from past experiences.

Question 4: What is Deep Learning, and how is it used in real-world?

Answer:

https://www.synergisticit.com/wp-content/uploads/2023/03/4_Question_4_What_is_Deep_Learn.mp3

Deep learning is a subset of Machine Learning that mimics the working of the human brain. It is inspired by the human brain cells, called neurons, and works on the concept of neural networks to solve complex real-world problems. It is also known as the deep neural network or deep neural learning.

Some real-world applications of deep learning are:

  1.  Adding different colors to the black&white images
  2. Computer vision
  3. Text generation
  4. Deep-Learning Robots, etc.

Question 5: How is machine learning related to AI?

Answer:

https://www.synergisticit.com/wp-content/uploads/2023/03/5_Question_5_How_is_machine_lea.mp3

Machine learning is a subset or subfield of Artificial intelligence. It is a way of achieving AI. As both are the two different concepts and the relation between both can be understood as “AI uses different Machine learning algorithms and concepts to solve the complex problems”.

Question 6: What is Markov's Decision process?

Answer:

https://www.synergisticit.com/wp-content/uploads/2023/03/6_Question_6_What_is_Markovs_D.mp3

The solution for a reinforcement learning problem can be achieved using the Markov decision process or MDP. Hence, MDP is used to formalize the RL problem. It can be said as the mathematical approach to solve a reinforcement learning problem. The main aim of this process is to gain maximum positive rewards by choosing the optimum policy.

Question 7: What are parametric and non-parametric model?

Answer:

https://www.synergisticit.com/wp-content/uploads/2023/03/7_Question_7_What_are_parametri.mp3

In machine learning, there are mainly two types of models, Parametric and Non-parametric. The explanation of these models is as follow:

Parametric Model: The parametric models use a fixed number of the parameters to create the ML model. It considers strong assumptions about the data. The examples of the parametric models are Linear regression, Logistic Regression, Naïve Bayes, Perceptron, etc.

 Non-Parametric Model: The non-parametric model uses flexible numbers of parameters. It considers a few assumptions about the data. These models are good for higher data and no prior knowledge. The examples of the non-parametric models are Decision Tree, K-Nearest Neighbour, SVM with Gaussian kernels, etc.

Question 8: What do you understand by the hyperparameter?

Answer:

https://www.synergisticit.com/wp-content/uploads/2023/03/8_Question_8_What_do_you_unders.mp3

In machine learning, hyperparameter is the parameters that determine and control the complete training process. The examples of these parameters are Learning rate, Hidden Layers, Hidden units, Activation functions, etc. These parameters are external from the model. The selection of good hyperparameters makes a better algorithm.

Question 9: What is overfitting? How can it be overcome in Machine Learning?

Answer:
https://www.synergisticit.com/wp-content/uploads/2023/03/9.Question_9_What_is_Back_Propa.m4a

When the machine learning algorithm tries to capture all the data points, and hence, as a result, captures noise also, then overfitting occurs in the model. Due to this overfitting issue, the algorithm shows the low bias, but the high variance in the output. Overfitting is one of the main issues in machine learning.

Methods to avoid Overfitting in ML:

  1. Cross-Validation
  2. Training With more data
  3. Regularization
  4. Ensembling
  5. Removing Unnecessary Features
  6. Early Stopping the training.

Question 10: Explain different ways to evaluate the performance of the Machine Learning model?

Answer:

https://www.synergisticit.com/wp-content/uploads/2023/03/10_Question_10_Explain_different.mp3

Some popular ways to evaluate the performance of the ML model are:

  1. Confusion Matrix: It is N*N table with different sets of value that is used to determine the performance of the classification model in machine learning.
  2. F1 score: It is the harmonic mean of precision and recall, which is used as one of the best metrics to evaluate the ML model. 
  3. Gain and lift charts: Gain & Lift charts are used to determine the rank ordering of the probabilities. 
  4. AUC-ROC curve: The AUC-ROC is another performance metric. The ROC is the plot between the sensitivity.
  5. Gini Coefficient: It is used in the classification problems, also known as the Gini Index. It determines the inequality between the values of variables. The high value of the Gini represents a good model.
  6. Root mean squared error: It is one of the most popular metrics used for the evaluation of the regression model. It works by assuming that errors are unbiased and have a normal distribution.
  7. Cross-Validation: It is another popular technique for evaluating the performance of the machine learning model. In this, the models are trained on subsets of the input data and evaluated on the complementary subset of the data.

Question 11: What Are the Three Stages of Building a Model in Machine Learning?

Answer:

https://www.synergisticit.com/wp-content/uploads/2023/03/11_Question_11_What_Are_the_Thre.mp3

The three stages of building a machine learning model are:

  1. Model Building: Choose a suitable algorithm for the model and train it according to the requirement.
  2. Model Testing: Check the accuracy of the model through the test data.
  3. Applying the Model: Make the required changes after testing and use the final model for real-time projects.

Here, it’s important to remember that once in a while, the model needs to be checked to make sure it’s working correctly. It should be modified to make sure that it is up-to-date.

Question 12: Compare K-means and KNN Algorithms?

Answer:

https://www.synergisticit.com/wp-content/uploads/2023/03/12_Question_12_Compare_Kmeans_a.mp3

Following are the comparison between both:

  1. K-NN is a Supervisedmachine learning while K-means is an unsupervised machine learning.
  2. K-NN is a classificationor regression machine learning algorithm whereas, K-means is a clustering machine learning algorithm.
  3. K-NN is a lazy learner.On the other hand, K-Means is an eager learner. An eager learner has a model fitting that means a training step but a lazy learner does not have a training phase.
  4. K-NN performs much better if all of the data have the same scale but this is not true for K-means.

Question 13: How Will You Know Which Machine Learning Algorithm to Choose for Your Classification Problem?

Answer:

https://www.synergisticit.com/wp-content/uploads/2023/03/13_Question_13_How_Will_You_Know.mp3

While there is no fixed rule to choose an algorithm for a classification problem, you can follow these guidelines:

  1. If accuracy is a concern, test different algorithms and cross-validate them.
  2. If the training dataset is small, use models that have low variance and high bias.
  3. If the training dataset is large, use models that have high variance and low bias.

Question 14: When Will You Use Classification over Regression?

Answer:

https://www.synergisticit.com/wp-content/uploads/2023/03/14_Question_14_When_Will_You_Use.mp3

Classification is used when your target is categorical, while regression is used when your target variable is continuous. Both classification and regression belong to the category of supervised machine learning algorithms.

Question 15: What is Kernel SVM?

Answer:
https://www.synergisticit.com/wp-content/uploads/2023/03/15_Question_15_What_is_Kernel_SV.mp3

Kernel SVM is the abbreviated version of the kernel support vector machine. Kernel methods are a class of algorithms for pattern analysis, and the most common one is the kernel SVM.

Question 16: What is the generalized linear model?

Answer:

https://www.synergisticit.com/wp-content/uploads/2023/03/16_Question_16_What_is_the_gener.mp3

The generalized linear model is the derivative of the ordinary linear regression model. GLM is more flexible in terms of residuals and can be used where linear regression does not seem appropriate. GLM allows the distribution of residuals to be other than a normal distribution. It generalizes the linear regression by allowing the linear model to link to the target variable using the linking function. Model estimation is done using the method of maximum likelihood estimation.

Question 17: What is Variance Inflation Factor?

Answer:

https://www.synergisticit.com/wp-content/uploads/2023/03/17_Question_17_What_is_Variance_.mp3

Variance Inflation Factor (VIF) is an estimate of the volume of multicollinearity in the collection of regression variables.

It is given as;

VIF = Variance of model / Variance of the model with a single independent variable.

Question 18: What is the difference between type I and type II error?

Answer:

https://www.synergisticit.com/wp-content/uploads/2023/03/18_Question_18_What_is_the_diffe.mp3

 Type I error is a false positive. Type I error is claiming something has happened when it hasn’t.

Type II error is a false negative error. Type II error is claiming nothing when in fact something has happened.

Question 19: Give a brief overview of sentiment analysis using machine learning?

Answer:

https://www.synergisticit.com/wp-content/uploads/2023/03/19_Question_19_Give_a_brief_over.mp3

 Sentiment analysis involves the use of natural learning processing to categories opinions expressed usually in a piece of text, in order to determine whether the writer’s attitude is positive, negative or neutral. Machine learning algorithms can be used to help ease the learning process of the bot. These learning algorithms are constantly fed with a massive amount of data so that it can adjust itself and continually improve.

Question 20: What is the difference between Data Mining and Machine Learning?

Answer:

https://www.synergisticit.com/wp-content/uploads/2023/03/20_Question_20_What_is_the_diffe.mp3

Data mining can be described as the process in which the structured data tries to abstract knowledge or interesting unknown patterns. During this process, machine learning algorithms are used. Whereas, Machine learning represents the study, design, and development of the algorithms which provide the ability to the processors to learn without being explicitly programmed.

The post SAS Interview Questions And Answers Part – 4 appeared first on SynergisticIT.



This post first appeared on Student Loan Crisis In The United States Solution, please read the originial post: here

Share the post

SAS Interview Questions And Answers Part – 4

×

Subscribe to Student Loan Crisis In The United States Solution

Get updates delivered right to your inbox!

Thank you for your subscription

×