Posts

Showing posts from March, 2021

Data Visualization with R Ch 2

Image
 Abinezer Abebe STEAM  Data Preparation CH 2 Introduction to ggplot2 in Chapter 2. We worked on ggplot2 packages. The first part in building a ggplot2 graph is creating a ggplot2 function. In building the graph there are two major part’s data and variables mapped to show the graph. The variables are strategically placed within the function. The next step is to add Geoms to our scatter plot graph. Geoms are objects that can be graphed using the Geom_ function. In our scatter plot graph there is an outlier that is larger than the rest of the wages in the graph showing the relationship between wages and experience. I took the outlier in data out and redesigned the scatter plot graph. Next I played around with parameters that controlled the color transparency and shape to redesign the graph. After I grouped the variables in two categories of male and female. The color that I chose for male was blue and female I chose pink. The variable colors fall under the ‘aes” function because we are me

Data Visualization with R Ch1

Image
Abinezer Abebe STEAM  Data Preparation  This semester we will be working on R. R is an integrated suite of software facilities for data manipulation calculation and graphical display. R was inherently made for statistical computing and graphics of data and very similar to the S language. Which means code written in S can run syntax free in R. R provides a wide range of statistical linear and nonlinear modeling. Often used as the choice in statically methodology. It was very simple and easy components such as mathematical symbols and formulae. R is a great design choice for a user to retain control. R runs on wide range of operating systems like Linux platforms FreeBSD Windows MacOS. This week I worked on getting my R IDE configurated and ready to take in data. The R environment is very effective at data handling and storage facility. Operators for calculations on an array or matrices is possible in R. R has coherent integrated collection of intermediate tools for data analysis. R also