Skip to main content

How to conduct Hypothesis Testing step by step - simple and elegant (part 3)

Step by Step procedure in Conducting the Hypothesis Testing: prerequisites:   Part 1:  What is Hypothesis Testing and When it is Used? Part 2:  How to decide Null and Alternate Hypothesis?                    https://www.isixsigma.com/   Before diving into the steps, first let’s understand the Important Terminology we need to understand: Null Hypothesis: It is a well-established fact which is believed by all the people. It can also defined as an Assumption that treats everything equally and similarly. Alternate Hypothesis: It is a new claim made against the Null Hypothesis. If we have enough evidence against the Null Hypothesis we will reject Null Hypothesis. P-value: Probability of Null Hypothesis being true. Significance level: probability of rejecting the Null Hypothesis when it is true. It is a critical point where we decide whether the Null Hypothesis is rejected or not. Generally, the significant level will be 0.05 which means a 5 percent risk while concluding

Simple Understanding of ALEXNET and its Architecture (part 1)

Alexnet is a state of art convolution neural network, because it has outperformed all the algorithms in terms of accuracy in ImageNet competition around 2012.

ImageNet competition is a challenge where a dataset is provided with 1000 categories with millions of images to be trained for image recognition.

 

ALEXNET ARCHITECTURE:

 

                                                                 Source: https://i0.wp.com/ramok.tech/wp-content/uploads/2017/12/2017-12-31_01h31_40.jpg

 

In the whole architecture, There are only convolution and max pooling operations followed by fully connected layers (yellow color) at the end, for classification.

For simple and fast understanding of the architecture you need to remember a formula:

[(n-k+2p)/s]+1

n -> size of image

k -> kernel size

p -> padding

s -> stride

·     Now if we observe the architecture, we are passing 224x224x3 image as an input to the first layer. 

   Lets simply understand what happening at first layer, so that you can easily analyze for other layers: 

·    In the first layer it is showing convolution operation with kernel size (11x11) and with stride (s=4).

Note 1: You need to remember that 96 such kernels are using for convolution operations. So, they represented it as 11x11x96.

Note 2: If padding is 'same', it means there is a padding operation otherwise there is no padding.

Now, Lets take our formula,

[(n-k+2p)/s]+1

Substitute n=224, k=11, s=4, p=0.

[(224-11)/4] +1 = [213/4] +1 = [53.15] +1 = 54+1 = 55

So the output we get is 55x55, since they applied 96 kernels, the resultant output we get is size of 55x55x96.

Similarly, The above math is applicable for all the layers both convolution and max pooling, Please try it on your own.

 

è In our figure, At the end of the blue color region the result should flattened and fed it into hidden layers.

è Till blue color, It is extracting image features and after that it is doing image classification.

 

Reasons Alexnet has outperformed all the algorithms till 2012:

1.  In Alexnet, They have used advanced concepts like Data Augmentation, Dropout layer, Relu Activation unit, Local Response Normalization (Normalizing all channels corresponding to a pixel instead of normalizing the whole tensor).

2.     The whole architecture has built on GPU’s.

 

Disadvantages:

1.  At every layer, They have used different kernel sizes like 11x11,5x5,3x3 with different padding and strides.

2.    Difficult to remember the architecture.


Following blogs might be useful for you.

VGG16:                     VGG16 ARCHITECTURE
INCEPTION_V3:     INCEPTION V3
RESNET:                   RESNET ARCHITECTURE


If you have any queries, please comment below...
For more content please follow the blogpost...

Comments

Popular posts from this blog

How to conduct Hypothesis Testing step by step - simple and elegant (part 3)

Step by Step procedure in Conducting the Hypothesis Testing: prerequisites:   Part 1:  What is Hypothesis Testing and When it is Used? Part 2:  How to decide Null and Alternate Hypothesis?                    https://www.isixsigma.com/   Before diving into the steps, first let’s understand the Important Terminology we need to understand: Null Hypothesis: It is a well-established fact which is believed by all the people. It can also defined as an Assumption that treats everything equally and similarly. Alternate Hypothesis: It is a new claim made against the Null Hypothesis. If we have enough evidence against the Null Hypothesis we will reject Null Hypothesis. P-value: Probability of Null Hypothesis being true. Significance level: probability of rejecting the Null Hypothesis when it is true. It is a critical point where we decide whether the Null Hypothesis is rejected or not. Generally, the significant level will be 0.05 which means a 5 percent risk while concluding

Simple Understanding of INCEPTION_V3 and it's Architecture (part 3)

INCEPTION_V3 Prerequisites:  VGGNet OR VGG16:  VGG16 Architecture   By looking it's name, everybody think's that it is a complicated story just like the movie INCEPTION. But trust me, I will prove you that it is wrong by explaining in the most detailed way. Till now, If we take a layer in any neural network we only applied single operation like convolution or maxpooling and also with fixed kernel size for the whole layer.  But Now, The idea is, why can't we use all the operations in a single layer at a time. There comes INCEPTION_V3. Lets zoom a single layer in the inception_v3, source: It's a screenshot from AndrewNg class If you observe the above figure, convolution operation with kernel sizes 1x1,3x3,5x5 and the max-pool operation, all have applied at a time. Here comes a problem, COMPUTATION. only from single layer we are getting billions of computations. For example, lets do a simple mathematical calculation here, Note: To understand this you need to know how conv

How to decide Null Hypothesis & Alternate Hypothesis - Examples (part 2)

Lets Understand Null Hypothesis and Alternate Hypothesis. Prerequisites : part 1:  What is Hypothesis Testing and When it is Used?   Null Hypothesis (H 0 ) : It is a well Established Fact or An Assumption which treats everything is similar and equal. Alternate Hypothesis(H a ) : The claim you want to prove. Note: H 0 and H a are always opposite to each other.                                                                                                                               https://keydifferences.com/ Example 1: Statement: During 18 th century, People used to believe that Sun revolves around the earth. Later Copernicus came and told that it is not true. Since, At the time of 18 th century “Sun revolves around the earth” is a well established fact and is widely accepted belief, So It will become our null hypothesis. Copernicus claimed that “Earth revolves around the sun” which is opposite to null hypothesis. So it will become Alternate Hypothesis. H 0 :   Sun revolves aro