From Dense Layers to Convolutions

mli · November 27, 2018, 11:38pm

https://d2l.ai/chapter_convolutional-neural-networks/why-conv.html

AngryBird3 · March 28, 2019, 8:36pm

Can someone shade light on:
This means that the input into a network has 1 million dimensions. Even an aggressive reduction to 1,000 dimensions after the first layer means that we need 10^9 parameters.

How?
I see input vector is 10^3. Output vector is of-course two. I don’t know for what sequential architecture we would need 10^9 param?

Kasper007 · March 31, 2019, 7:07pm

According to the text, the input vector is 1M (pixels) and the first (fully connected) layer reduces the number of dimensions to 1K (i.e. the layer has 1000 neurons) which sums up to 10^6 x 10^3 = 10^9 parameters.

andrea_api · August 12, 2019, 12:43pm

just fyi,
“But of course, if those biases do not agree with reality, e.g. if images turned out not to be translation invariant,”
this sentence seems to be incomplete.

Daeshik_Choi · August 17, 2019, 3:29am

The equations in 6.1.2 do not express a bias term explicitly and Conv2D implemented in 6.2.2 has a bias term. I think that it would be better if you express a bias term in 6.1.2 clearly. For example, we can add b[i,j] to the summation in each equation in 6.1.2. By translation invariance, it's clear that b[i,j] does not depend on i or j. Thus b[i,j] is a constant.

Please let me know if my understanding is not correct.

Topic		Replies	Views
How to set parameters in Conv2D and Conv2DTranspose	1	2013	December 4, 2018
Transformer D2L Book	6	1367	April 24, 2020
Multilayer Perceptron in Gluon D2L Book	1	734	August 12, 2019
Very simple question on input parameters of fullyconnected	6	708	April 28, 2018
Implementing a Multilayer Perceptron from Scratch D2L Book	4	972	April 7, 2019

From Dense Layers to Convolutions

Related Topics