Which of the following is not a keyword?

What percentage of numbers from 1 to 70 have 1 or 9 in the unit's digit?

What are the benefits of mini-batch gradient descent?

What do you mean by a hard margin?

In a 300 m race A beats B by 22.5 m or 6 seconds. B's time over the course is:

In a 200 metres race A beats B by 35 m or 7 seconds. A's time over the course is:

If you increase the number of hidden layers in a Multi Layer Perceptron, the classification error of test data always decreases.

Which of the following statement is FALSE about ADAGRAD?

Which gradient technique is more advantageous when the data is too big to handle in RAM simultaneously?

Which of the following offsets, do we use in linear regressions least square line fit? Suppose horizontal axis is independent variable and vertical axis is dependent variable.

Which of the following is an invalid statement?

What is the output of this expression, 3*1**3?

Which is the correct operator for power(xy)?

In a neural network, which of the following techniques is used to deal with overfitting?

A, B, C rent a pasture. A puts 10 oxen for 7 months, B puts 12 oxen for 5 months and C puts 15 oxen for 3 months for grazing. If the rent of the pasture is Rs. 175, how much must C pay as his share of rent?

Lets say, you are using activation function X in hidden layers of neural network. At a particular neuron for any given input, you get the output as -0.0001 Which of the following activation function could X represent?

Suppose you have inputs as x, y, and z with values -2, 5, and -4 respectively. You have a neuron ‘q’ and neuron ‘f’ with functions: q = x + y , f = q * z

**What is the gradient of F with respect to x, y, and z?**

**(HINT: To calculate the gradient, you must find (df/dx), (df/dy) and (df/dz))**

For a classification task, instead of random weight initializations in a neural network, we set all the weights to zero. Which of the following statements is true?

What are Hyperparameters are set before training?

Which of the following are real world applications of the SVM?

What is a dead unit in a neural network?

What will be the output of print(print())

The neural network consists of many neurons, each neuron takes an input, processes it and gives an output. Here’s a diagrammatic representation of a real neuron.

** Which of the following statement(s) correctly represents a real neuron?**

In training a neural network, you notice that the loss does not decrease in the few starting epochs.

**The reasons for this could be:**

**The learning rate is low** **The regularization parameter is high** **Stuck at local minima**

**What according to you are the probable reasons?**

Batch Normalization is helpful because

How would you import a decision tree classifier in sklearn?

Three numbers which are co-prime to each other are such that the product of the first two is 551 and that of the last two is 1073. The sum of the three numbers is:

What is the sequence of the following tasks in a perceptron?

**Initialize weights of perceptron randomly** **Go to the next batch of the dataset** **If the prediction does not match the output, change the weights** **For a sample input, compute an output**

Which of the following is not a complex number?

What is the output of type(45/3)

The cost price of 20 articles is the same as the selling price of x articles. If the profit is 25%, then the value of x is:

A and B can together finish a work 30 days. They worked together for 20 days and then B left. After another 20 days, A finished the remaining work. In how many days A alone can finish the work?

Consider a function which is defined below:

def fun(x): x[0] = 5 return x

Now you define a list which has three numbers in it. g = [10,11,12] Which of the following will be the output of the given print statement: print fun(g), g

The least number which when divided by 5, 6 , 7 and 8 leaves a remainder 3, but when divided by 9 leaves no remainder, is:

What will be the output of the following Python code snippet if x=1? x<<2

Different learning methods does not include

What will be the output of the following Python code? 1.>>>str="hello" 2.>>>str[:-3] 3.>>>

What would be the best value for random_state (Seed value ?

What do you mean by generalization error in terms of the SVM?

Which of the following is incorrect?

How much did Rohit get as profit at the year-end in the business done by Nitin, Rohit and Kunal?

I. Kunal invested Rs. 8000 for nine months, his profit was times that of Rohit's, and his investment was four times that of Nitin.

II. Nitin and Rohit invested for one year in the proportion 1: 2 respectively.

III. The three together got Rs. 1000 as profit at the year-end.

Which of the following methods do we use to find the best fit line for data in Linear Regression?

Which of the following is an invalid variable?

K-fold cross-validation is

Which one of the following has the same precedence level?

What error occurs when you execute the following Python code snippet? apple = mango

If a = 0.1039, then the value of 4a2 - 4a + 1 + 3a is:

What are the steps for using a gradient descent algorithm?

**Calculate the error between the actual value and the predicted value** **Reiterate until you find the best weights of the network** **Pass an input through the network and get values from the output layer** **Initialize random weight and bias** **Go to each neuron that contributes to the error and change its respective values to reduce the error**

Which of the following cannot be a variable?

Given a function that does not return any value, What value is thrown by default when executed in shell.

Which of the given point is true about auto encoder?

Which of the following sentence is FALSE regarding regression?

How to select best hyperparameters in tree based models?

Import time str = '21/01/2017' datetime_value = time.strptime(str,date_format)

To convert the above string, what should be written in place of date_format?

If I am using all features of my dataset and I achieve 100% accuracy on my training set, but ~70% on validation set, what should I look out for?

Which of the following is incorrect?

Adding more basis functions in a linear model... (pick the most probably option)

4 mat-weavers can weave 4 mats in 4 days. At the same rate, how many mats would be woven by 8 mat-weavers in 8 days?

Which of the following is invalid?

Which of the following is an example of a deterministic algorithm?

Type of abc is ? if abc=100,000,10

What will be the output of the following Python expression?

Three times the first of three consecutive odd integers is 3 more than twice the third. The third integer is:

Now let’s revise the previous slides. We have learned that:

**A neural network is a (crude) mathematical representation of a brain, which consists of smaller components called neurons.** **Each neuron has an input, a processing function, and an output.** **These neurons are stacked together to form a network, which can be used to approximate any function.** **To get the best possible neural network, we can use techniques like gradient descent to update our neural network model.**

**Given above is a description of a neural network. When does a neural network model become a deep learning model?**

Which statement is true about NAG?

Two dice are thrown simultaneously. What is the probability of getting two numbers whose product is even?

The expression Int(x) implies that the variable x is converted to integer.

What if we use a learning rate that’s too large?

What are the factors to select the depth of neural network?

- Type of neural network (eg. MLP, CNN etc)
- Input data
- Computation power, i.e. Hardware capabilities and software capabilities
- Learning Rate
- The output function to map

Which of the following is not an example of a time series model?

Two, trains, one from Howrah to Patna and the other from Patna to Howrah, start simultaneously. After they meet, the trains reach their destinations after 9 hours and 16 hours respectively. The ratio of their speeds is:

When pooling layer is added in a convolutional neural network, translation in-variance is preserved.

A is 30% more efficient than B. How much time will they, working together, take to complete a job which A alone could have done in 23 days?

Which of the following will run without errors?

The number of neurons in the output layer should match the number of classes (Where the number of classes is greater than 2) in a supervised learning task.

Which of the following cross validation techniques is better suited for time series data?

The effectiveness of an SVM depends upon:

Two stations A and B are 110 km apart on a straight line. One train starts from A at 7 a.m. and travels towards B at 20 kmph. Another train starts from B at 8 a.m. and travels towards A at a speed of 25 kmph. At what time will they meet?

“Convolutional Neural Networks can perform various types of transformation (rotations or scaling) in an input”.

Is the statement correct True or False?

Which of the following will be Euclidean Distance between the two data point A(1,3) and B(2,3)?

Which of the following distance metric can not be used in k-NN?

In random forest or gradient boosting algorithms, features can be of any type. For example, it can be a continuous feature or a categorical feature. Which of the following option is true when you consider these types of features?

Which of the following algorithm are not an example of ensemble learning algorithm?

What is the volume of 32 metre high cylindrical tank?

1. The area of its base is 154 m2.

2. The diameter of the base is 14 m.

A can run 22.5 m while B runs 25 m. In a kilometre race B beats A by:

A, B and C enter into a partnership in the ratio 7/2:4/3:6/5 After 4 months, A increases his share 50%. If the total profit at the end of one year be Rs. 21,600, then B's share in the profit is:

Which one of the following has the highest precedence in the expression?

In which neural net architecture, does weight sharing occur?

For an image recognition problem (recognizing a cat in a photo), which architecture of neural network would be better suited to solve the problem?

Which of the following option is true about k-NN algorithm?

In a regular week, there are 5 working days and for each day, the working hours are 8. A man gets Rs. 2.40 per hour for regular work and Rs. 3.20 per hours for overtime. If he earns Rs. 432 in 4 weeks, then how many hours does he work for ?

You are building a neural network where it gets input from the previous layer as well as from itself.

**Which of the following architecture has feedback connections?**