Thursday, June 21, 2018

R -Programming on Data distribution

    Today we will learn data distribution. Data distribution is very important for the data analysis as well as the data analytics.
    When we are dealing with the big data we have to think about the data distribution. There are different types of distribution like
    Binomial distribution
    Possion distribution
    Normal distribution
    Uniform distribution
    Beta distribution
    Gamma distribution
    And so on...
    Another thing is that we *can customise our distribution*
    Here i am creating one vector that is sohini
    So, *sohini=(1:100)*
    It means sohini vector includes data from the range 1 to 100.
    1 is the lower bound that is minimum and 100 is the upper bound that is maximum.Let's calculate the mean and the sd of the sohini vector.
    Since the data distribution in sohini is uniform we find the box whisker pot similar to the plot of uniform distribution.Again in the density plot we will find the uniform distribution here the command is *plot(sohini)*
    We can use *summary(sohini)* we will get the result minimum, 1st quartile, mean, median, 3rd quartile and the maximum.
    Unif is the command for uniform distribution. So the command is
    rai=10
    akshita=50
    sohini =runif(sohini, rai, akshita)
    Be careful of capital and small letter. Some functions like mean,sd,summary. Here we are using lower case. But there are some commands like NROW,NCOL, there we should use capital.
    Now you will find two different types of plots earlier you have received plot from one to hundred but here, you are getting the plot that is also the uniform distribution plot where, minimum is ten and maximum is fifty, so when you are writing *sohini=runif(sohini)*
    So sohini means 100, min that is rai and akshita the maximum
    So this is the range.
    Comments
    18.6.18
    Akshita Bindal wrote on R programming
    ...See more
    LikeShow More Reactions
    Comment
    Comments

No comments:

Post a Comment