bree7246 bree7246
  • 19-08-2021
  • Computers and Technology
contestada

In the Gradient Descent algorithm, we are more likely to reach the global minimum, if the learning rate is selected to be a large value.

a. True
b. False

Respuesta :

oddkevin9
oddkevin9 oddkevin9
  • 19-08-2021
False iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
Answer Link
swan2414 swan2414
  • 23-08-2021

Answer:

false i think.

Explanation:

Gradient Descent is more likely to reach a local minima. because starting at different points and just in general having a different starting point, will lead us to a different local minimum( aka the lowest point closest to the starting point). if alpha(the learning rate) is too large, gradient descent may fail to converge and may even diverge.

Answer Link

Otras preguntas

What does 90-3x equal
What is 5/8 times 24
how do I divide 178.8 by 24?
three consecutive odd numbers have a sum of 255 what is the value of the largest number
What is the value of x when -3x + 7x = -12? I just want to make sure my steps are in the right direction on this.
In their competition for power, nations raced to?
Which is a run-on sentence? a. Roses have sharp thorns, but they are beautiful just the same. b. African violets usually have purple flowers some have white o
A music store sold 10^3 CDs and 10^2 CDs players.If each CD costs $12 and each CD player costs $35,what was the store's total earnings
8 tenths minus 5 hundredths
Which of the following experienced a major revolution in the twentieth century? the United States Russia France Japan