Ec140 - Mean and Expectation

class: center, middle, inverse, title-slide

# Ec140 - Mean and Expectation
### Fernando Hoces la Guardia
### 06/23/2022

---

# Housekeeping

- Updated Syllabus

- Unofficial Course Capture!

- What is the weirdest concept you remember from yesterday?

- Switch to finish yesterday's slides

---
# This Lecture

- Introduction to Data

- Mean and Expectation

- Variance and Standard Deviation

---
# What Defines a Data Set?

.font90[
- Data Set is the collection of any type information (of multiple *Datum*)

- In quantitative analysis we focus on *structured* data sets (unlike, for example, unstructured field notes).

- In econometrics the most commnon way to structure data is in tabular, or rectangular, form.

- A tabular data set is a collection of variables that with information for one or more entities.

- Entities can represent multiple individuals, one individual over time, firms, countries, etc.

- Variables are represented in columns, and observations are represented by rows. (for more on variables [The Effect, Ch3](https://theeffectbook.net/ch-DescribingVariables.html#descriptions-of-variables))

]

---
# Data
.pull-left[
.font60[
<div id="htmlwidget-2eda341b58d768862ed0" style="width:100%;height:auto;" class="datatables html-widget"></div>
<script type="application/json" data-for="htmlwidget-2eda341b58d768862ed0">{"x":{"filter":"none","vertical":false,"caption":"<caption>Sample of US workers (Current Population Survey, 1976)<\/caption>","fillContainer":false,"data":[["1","2","3","4","5","6","7","8","9","10","11","12","13","14","15","16","17","18","19","20","21","22","23","24","25","26","27","28","29","30","31","32","33","34","35","36","37","38","39","40","41","42","43","44","45","46","47","48","49","50","51","52","53","54","55","56","57","58","59","60","61","62","63","64","65","66","67","68","69","70","71","72","73","74","75","76","77","78","79","80","81","82","83","84","85","86","87","88","89","90","91","92","93","94","95","96","97","98","99","100","101","102","103","104","105","106","107","108","109","110","111","112","113","114","115","116","117","118","119","120","121","122","123","124","125","126","127","128","129","130","131","132","133","134","135","136","137","138","139","140","141","142","143","144","145","146","147","148","149","150","151","152","153","154","155","156","157","158","159","160","161","162","163","164","165","166","167","168","169","170","171","172","173","174","175","176","177","178","179","180","181","182","183","184","185","186","187","188","189","190","191","192","193","194","195","196","197","198","199","200","201","202","203","204","205","206","207","208","209","210","211","212","213","214","215","216","217","218","219","220","221","222","223","224","225","226","227","228","229","230","231","232","233","234","235","236","237","238","239","240","241","242","243","244","245","246","247","248","249","250","251","252","253","254","255","256","257","258","259","260","261","262","263","264","265","266","267","268","269","270","271","272","273","274","275","276","277","278","279","280","281","282","283","284","285","286","287","288","289","290","291","292","293","294","295","296","297","298","299","300","301","302","303","304","305","306","307","308","309","310","311","312","313","314","315","316","317","318","319","320","321","322","323","324","325","326","327","328","329","330","331","332","333","334","335","336","337","338","339","340","341","342","343","344","345","346","347","348","349","350","351","352","353","354","355","356","357","358","359","360","361","362","363","364","365","366","367","368","369","370","371","372","373","374","375","376","377","378","379","380","381","382","383","384","385","386","387","388","389","390","391","392","393","394","395","396","397","398","399","400","401","402","403","404","405","406","407","408","409","410","411","412","413","414","415","416","417","418","419","420","421","422","423","424","425","426","427","428","429","430","431","432","433","434","435","436","437","438","439","440","441","442","443","444","445","446","447","448","449","450","451","452","453","454","455","456","457","458","459","460","461","462","463","464","465","466","467","468","469","470","471","472","473","474","475","476","477","478","479","480","481","482","483","484","485","486","487","488","489","490","491","492","493","494","495","496","497","498","499","500","501","502","503","504","505","506","507","508","509","510","511","512","513","514","515","516","517","518","519","520","521","522","523","524","525","526"],[3.1,3.24,3,6,5.3,8.75,11.25,5,3.6,18.18,6.25,8.13,8.77,5.5,22.2,17.33,7.5,10.63,3.6,4.5,6.88,8.48,6.33,0.53,6,9.56,7.78,12.5,12.5,3.25,13,4.5,9.68,5,4.68,4.27,6.15,3.51,3,6.25,7.81,10,4.5,4,6.38,13.7,1.67,2.93,3.65,2.9,1.63,8.6,5,6,2.5,3.25,3.4,10,21.63,4.38,11.71,12.39,6.25,3.71,7.78,19.98,6.25,10,5.71,2,5.71,13.08,4.91,2.91,3.75,11.9,4,3.1,8.45,7.14,4.5,4.65,2.9,6.67,3.5,3.26,3.25,8,9.85,7.5,5.91,11.76,3,4.81,6.5,4,3.5,13.16,4.25,3.5,5.13,3.75,4.5,7.63,15,6.85,13.33,6.67,2.53,9.8,3.37,24.98,5.4,6.11,4.2,3.75,3.5,3.64,3.8,3,5,4.63,3,3.2,3.91,6.43,5.48,1.5,2.9,5,8.92,5,3.52,2.9,4.5,2.25,5,10,3.75,10,10.95,7.9,4.72,5.84,3.83,3.2,2,4.5,11.55,2.14,2.38,3.75,5.52,6.5,3.1,10,6.63,10,2.31,6.88,2.83,3.13,8,4.5,8.65,2,4.75,6.25,6,15.38,14.58,12.5,5.25,2.17,7.14,6.22,9,10,5.77,4,8.75,6.53,7.6,5,5,21.86,8.64,3.3,4.44,4.55,3.5,6.25,3.85,6.18,2.91,6.25,6.25,9.05,10,11.11,6.88,8.75,10,3.05,3,5.8,4.1,8,6.15,2.7,2.75,3,3,7.36,7.5,3.5,8.1,3.75,3.25,5.83,3.5,3.33,4,3.5,6.25,2.95,5.71,3,22.86,9,8.33,3,5.75,6.76,10,3,3.5,3.25,4,2.92,3.06,3.2,4.75,3,18.16,3.5,4.11,1.96,4.29,3,6.45,5.2,4.5,3.88,3.45,10.91,4.1,3,5.9,18,4,3,3.55,3,8.75,2.9,6.26,3.5,4.6,6,2.89,5.58,4,6,4.5,2.92,4.33,18.89,4.28,4.57,6.25,2.95,8.75,8.5,3.75,3.15,5,6.46,2,4.79,5.78,3.18,4.68,4.1,2.91,6,3.6,3.95,7,3,6.08,8.63,3,3.75,2.9,3,6.25,3.5,3,3.24,8.02,3.33,5.25,6.25,3.5,2.95,3,4.69,3.73,4,4,2.9,3.05,5.05,13.95,18.16,6.25,5.25,4.79,3.35,3,8.43,5.7,11.98,3.5,4.24,7,6,12.22,4.5,3,2.9,15,4,5.25,4,3.3,5.05,3.58,5,4.57,12.5,3.45,4.63,10,2.92,4.51,6.5,7.5,3.54,4.2,3.51,4.5,3.35,2.91,5.25,4.05,3.75,3.4,3,6.29,2.54,4.5,3.13,6.36,4.68,6.8,8.53,4.17,3.75,11.1,3.26,9.13,4.5,3,8.75,4.14,2.87,3.35,6.08,3,4.2,5.6,10,12.5,3.76,3.1,4.29,10.92,7.5,4.05,4.65,5,2.9,8,8.43,2.92,6.25,6.25,5.11,4,4.44,6.88,5.43,3,2.9,6.25,4.34,3.25,7.26,6.35,5.63,8.75,3.2,3,3,12.5,2.88,3.35,6.5,10.38,4.5,10,3.81,8.8,9.42,6.33,4,2.9,20,11.25,3.5,6,14.38,6.36,3.55,3,4.5,6.63,9.3,3,3.25,1.5,5.9,8,2.9,3.29,6.5,4,6,4.08,3.75,3.05,3.5,2.92,4.5,3.35,5.95,8,3,5,5.5,2.65,3,4.5,17.5,8.18,9.09,11.82,3.25,4.5,4.5,3.71,6.5,2.9,5.6,2.23,5,8.33,2.9,6.25,4.55,3.28,2.3,3.3,3.15,12.5,5.15,3.13,7.25,2.9,1.75,2.89,2.9,17.71,6.25,2.6,6.63,3.5,6.5,3,4.38,10,4.95,9,1.43,3.08,9.33,7.5,4.75,5.65,15,2.27,4.67,11.56,3.5],[11,12,11,8,12,16,18,12,12,17,16,13,12,12,12,16,12,13,12,12,12,12,16,12,11,16,16,16,15,8,14,14,13,12,12,16,12,4,14,12,12,12,14,11,13,15,10,12,14,12,12,16,12,12,12,15,16,8,18,16,13,14,10,10,14,14,16,12,16,12,16,17,12,12,12,13,12,12,12,18,9,16,10,12,12,12,12,12,8,12,12,14,12,12,12,9,13,12,14,12,15,12,12,12,14,15,12,12,12,17,11,18,12,14,14,10,14,12,15,8,16,14,15,12,18,16,10,8,10,11,18,15,12,11,12,12,14,16,2,14,16,12,12,13,12,15,10,12,16,13,9,12,13,12,12,14,16,16,9,18,10,10,13,12,18,13,12,13,13,13,18,12,12,13,12,12,12,14,10,12,16,16,12,14,12,12,12,12,12,12,12,16,16,14,11,16,12,12,17,12,12,16,8,12,12,12,16,12,12,9,13,16,14,8,14,13,12,18,9,8,8,12,14,12,16,8,13,9,16,12,15,11,14,12,12,12,18,12,12,12,12,12,12,14,16,12,14,11,12,10,12,6,13,12,10,12,14,13,12,18,12,12,12,12,12,8,13,13,14,12,10,16,12,16,12,14,18,17,13,14,15,14,12,8,12,12,8,12,9,12,16,12,16,12,12,13,10,6,12,12,16,12,8,12,6,4,11,11,7,12,18,12,16,12,14,12,10,10,9,10,12,12,12,10,16,16,16,12,12,7,8,16,16,18,13,10,16,14,16,12,9,11,11,12,11,12,12,12,12,14,14,18,12,12,12,11,12,17,16,13,13,12,14,14,11,10,8,14,12,10,17,9,12,12,14,16,12,10,0,14,15,16,12,11,11,12,13,12,13,16,15,16,15,12,18,6,6,12,12,16,9,12,11,10,12,8,9,17,16,11,10,8,13,14,13,11,7,16,12,13,14,16,14,11,8,14,17,10,12,12,18,14,18,12,16,14,12,9,12,12,17,12,15,17,16,12,15,16,12,15,12,12,12,12,16,11,14,14,13,14,12,12,8,12,3,11,15,11,12,4,9,12,12,11,12,16,13,15,16,12,12,12,9,10,12,11,8,6,16,12,12,16,12,10,13,13,14,16,10,12,12,11,0,5,16,16,9,15,12,12,12,13,12,7,17,12,12,14,12,13,12,16,10,15,16,14],[0,2,0,28,2,8,7,3,4,21,2,0,0,3,15,0,0,10,0,6,4,13,9,1,8,3,10,0,0,1,5,5,16,3,0,4,6,15,3,0,0,5,0,12,4,13,0,2,2,1,0,2,5,7,0,0,1,0,8,0,20,5,8,0,3,23,4,3,5,2,0,2,8,34,0,19,0,1,13,0,5,1,0,5,2,3,0,4,24,7,6,39,0,0,1,1,0,22,2,0,6,0,12,4,7,3,11,10,0,0,12,25,3,0,16,0,0,2,1,12,1,0,0,0,3,3,3,30,2,1,3,0,1,0,0,0,5,3,13,11,20,0,1,0,2,2,0,4,5,15,0,0,3,5,2,5,0,2,1,4,0,0,5,4,1,6,2,5,0,21,7,1,10,4,5,9,5,4,3,11,2,2,11,0,11,16,8,8,0,0,6,2,0,3,0,1,0,2,3,8,19,2,0,0,0,6,0,2,12,0,2,10,2,24,24,2,3,2,0,15,0,4,3,4,0,2,2,0,7,1,26,0,5,3,0,0,1,0,1,0,10,5,0,0,7,0,0,3,0,0,6,13,2,3,0,23,0,1,7,0,0,0,0,1,44,6,17,0,0,8,0,1,6,2,1,3,0,20,1,7,4,23,1,26,0,1,2,0,0,8,4,0,1,2,0,13,26,6,5,9,0,9,2,2,7,0,31,2,1,0,3,8,0,0,2,1,0,7,2,12,0,1,0,0,16,28,4,0,3,0,0,6,0,10,1,5,3,3,2,0,20,2,31,2,11,3,9,0,0,5,0,2,5,1,2,0,2,0,0,7,3,2,0,4,2,7,25,0,15,1,3,0,0,5,1,0,10,6,10,4,4,5,12,10,1,4,0,8,0,10,0,0,15,24,5,0,0,25,5,2,4,2,5,0,3,3,0,0,2,0,0,11,21,3,0,0,21,2,2,2,1,2,3,0,1,3,18,0,1,2,2,0,8,1,1,0,8,18,0,4,25,0,4,9,0,0,0,10,0,2,0,2,1,7,4,0,13,33,0,0,17,2,24,20,30,9,1,9,6,0,9,4,10,0,14,22,5,12,13,0,0,0,7,11,1,8,3,0,2,1,6,2,2,0,0,0,30,21,1,5,2,1,0,0,3,3,0,3,3,14,1,0,0,17,0,0,1,11,5,1,0,2,0,18,1,4],[1,1,0,0,0,0,0,1,1,0,1,1,0,0,0,0,1,1,1,1,1,0,1,1,1,0,0,0,0,1,0,1,1,1,1,1,1,0,1,1,1,1,1,1,1,0,0,1,0,1,1,1,0,0,0,0,1,0,1,0,1,0,0,1,0,0,1,0,0,1,1,0,0,1,1,0,1,1,0,0,0,0,1,0,1,1,1,1,0,0,0,0,1,1,1,1,0,0,0,0,1,1,1,1,0,1,0,0,1,0,0,0,0,0,1,0,1,0,0,1,1,1,0,1,0,1,1,0,1,0,0,0,0,1,0,1,0,0,0,1,0,0,1,1,1,1,1,1,1,1,0,0,0,1,1,0,0,0,1,0,0,1,0,1,1,1,1,1,0,0,1,0,1,1,1,1,1,0,0,1,0,1,1,0,1,0,0,0,0,0,1,0,0,1,1,1,1,0,0,0,0,0,0,1,1,1,1,0,1,1,1,1,1,0,0,0,1,0,0,1,0,1,1,1,0,1,1,1,0,0,0,0,0,0,0,0,1,0,1,1,1,1,0,1,0,1,0,1,1,0,0,0,1,0,1,0,1,0,0,1,0,0,1,1,0,1,1,1,1,0,1,0,1,0,1,0,0,0,1,1,1,0,0,0,1,0,0,0,0,0,0,1,1,1,0,0,1,0,0,1,0,0,0,1,0,0,0,1,1,1,1,1,0,0,0,0,1,0,1,0,1,1,1,1,0,0,0,0,1,1,0,0,0,0,1,0,1,1,0,0,1,0,0,0,0,0,1,1,0,0,0,0,1,1,0,1,0,0,0,1,1,1,0,1,0,0,0,1,1,1,0,1,1,1,0,0,0,0,1,1,1,1,0,0,1,0,1,1,1,0,0,1,1,0,0,0,1,1,0,1,1,0,0,1,1,0,1,1,1,0,1,0,0,0,1,1,0,1,1,0,1,0,0,0,1,1,0,1,0,0,0,0,0,1,0,1,0,0,1,0,0,1,1,0,0,1,0,0,1,0,1,1,1,1,0,1,0,0,1,0,1,1,1,1,0,0,1,0,0,0,0,0,1,1,1,0,0,0,0,1,0,0,0,0,1,0,0,1,0,1,0,0,1,1,1,1,0,1,0,0,1,0,1,1,0,0,0,1,1,0,1,0,0,0,1,1,0,0,0,0,0,1,1,0,0,1],[0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,1,0,0,0,0,0,0,0,0,0,1,1,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,1,0,0,1,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,1,0,0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,1,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,1,0,0,0,0,0,0,0,1,0,0,0,0,1,0,1,0,0,0,0,0,0,1,0,0,0,1,1,0,0,0,0,0,1,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,1,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,1,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,1,0,0,0,0,0,1,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,1,0,0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,1,0,0,0,1,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,1,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,1]],"container":"<table class=\"display\">\n  <thead>\n    <tr>\n      <th> <\/th>\n      <th><span style=\"color: #007935 !important\">Wage<\/span><\/th>\n      <th><span style=\"color: #007935 !important\">Education<\/span><\/th>\n      <th><span style=\"color: #007935 !important\">Tenure<\/span><\/th>\n      <th><span style=\"color: #007935 !important\">Female?<\/span><\/th>\n      <th><span style=\"color: #007935 !important\">Non-white?<\/span><\/th>\n    <\/tr>\n  <\/thead>\n<\/table>","options":{"pageLength":12,"lengthChange":false,"searching":false,"columnDefs":[{"className":"dt-right","targets":[1,2,3,4,5]},{"orderable":false,"targets":0}],"order":[],"autoWidth":false,"orderClasses":false,"rowCallback":"function(row, data, displayNum, displayIndex, dataIndex) {\nvar value=data[1]; $(this.api().cell(row, 1).node()).css({'color':'#9370DB'});\nvar value=data[2]; $(this.api().cell(row, 2).node()).css({'color':'#9370DB'});\nvar value=data[3]; $(this.api().cell(row, 3).node()).css({'color':'#9370DB'});\nvar value=data[4]; $(this.api().cell(row, 4).node()).css({'color':'#9370DB'});\nvar value=data[5]; $(this.api().cell(row, 5).node()).css({'color':'#9370DB'});\nvar value=data[0]; $(this.api().cell(row, 0).node()).css({'color':'#FD5F00'});\n}"}},"evals":["options.rowCallback"],"jsHooks":[]}</script>
]
]

---
# But What Can We Do With Data?

--
.font90[

- We summarized it! (see the great [short story by J.L. Borges](https://tinyurl.com/yx2n5xon) on why summarizing is essential)

.pull-left[

- One of the first thing we do when summarizing data is to look at *some type of average*.

- Wait? *Type* of average? Isn't there just one average? called *the mean*?

]

---
count:true
background-image: url("Images/average_def.png")
background-size: 50%
background-position: 100% 40%

# But What Can We Do With Data?

.font90[

- We summarized it! (see the great [short story by J.L. Borges](https://tinyurl.com/yx2n5xon) on why summarizing is essential)

.pull-left[

- One of the first thing we do when summarizing data is to look at *some type of average*.

- Wait? *Type* of average? Isn't there just one average? called *the mean*?

- These is also referred as measure of central tendency. 
- In this course, we will focus primarily on the mean. **From now on in this course mean = average**.

]

---
# Mean

-  The mean is defined by the sum of a set of values divided by the number of values.

Let’s look at the mean from the "hang out with a friend" exercise.

- Total over N

$$
`\begin{equation}
Average(X) = \frac{ 1 \times 10 + 
                    2 \times 9 + 
                    3 \times 11 }{30} =
                    \color{#9370DB}{2.03}
\end{equation}`
$$

- One number, **highly informative** for a variable of interest.
- Always important to keep an eye on the units and magnitude (relevant for PS1).

---
# Mean of a Binary Variable

- The interpretation for the mean of a binary variable is different from the case when there are more than two values.

- Above, the interpretation of `$Average(X) = 2.03$` can be read as "close to having an OK time with a friend".

- But when variables only take two values, and we assing those values to be 0 and 1, the interpretation of the mean is "the proportion of all the cases where the variable takes the value of one".

- Think of the the variable `hispanic` for students in this classroom (1 if identifies as hispanic, 0 otherwise).

---
count:true
# Mean: .font80[Notation (Message to me: draw histogram on the board)]
--

.font90[

$$
`\begin{equation}
Average(X) = \frac{ 1 \times 10 + 
                    2 \times 9 + 
                    3 \times 11 }{30} =
                    \color{#9370DB}{2.03}\\
\end{equation}`
$$

$$
`\begin{equation}
Ave(X) = 1 \times \frac{10}{30} + 
         2 \times \frac{9}{30} + 
         3 \times \frac{11}{30}   =
                    \color{#9370DB}{2.03}\\
\end{equation}`
$$

]

---
count:true
# Mean: Notation

.font90[
$$
`\begin{equation}
Average(X) = \frac{ 1 \times 10 + 
                    2 \times 9 + 
                    3 \times 11 }{30} =
                    \color{#9370DB}{2.03}\\
\end{equation}`
$$

$$
`\begin{equation}
Ave(X) = \color{#FD5F00}{1} \times \color{#007935}{\frac{10}{30}} + 
         \color{#FD5F00}{2} \times \color{#007935}{\frac{9}{30}} + 
         \color{#FD5F00}{3} \times \color{#007935}{\frac{11}{30} }  =
                    \color{#9370DB}{2.03}\\
\end{equation}`
$$
]
--
.font90[
$$
`\begin{equation}
\overline{X}_{n} = \color{#FD5F00}{x_{1}} \times \color{#007935}{proportion(x_{1})} + 
         \color{#FD5F00}{x_{2}} \times \color{#007935}{proportion(x_{2})} + 
         \color{#FD5F00}{x_{3}} \times \color{#007935}{proportion(x_{3})}\\ 
\end{equation}`
$$
]

--
.font90[
$$
`\begin{equation}
\overline{X}_{n} = \text{summing across all } x \left(  \color{#FD5F00}{x} \times \color{#007935}{proportion_{n}(x)} \right)\\ 
\end{equation}`
$$
]

--
.font90[
$$
`\begin{equation}
\overline{X}_{n} = \sum_{x}  \color{#FD5F00}{x} \times \color{#007935}{prop_{n}(x)}\\ 
\end{equation}`
$$

]

---
count:true
# Mean: Notation

.font90[
$$
`\begin{equation}
\overline{X}_{n} = \color{#FD5F00}{x_{1}} \times \color{#007935}{proportion(x_{1})} + 
         \color{#FD5F00}{x_{2}} \times \color{#007935}{proportion(x_{2})} + 
         \color{#FD5F00}{x_{3}} \times \color{#007935}{proportion(x_{3})}\\ 
\end{equation}`
$$
]

.font90[
$$
`\begin{equation}
\overline{X}_{n} = \text{summing across all } x \left(  \color{#FD5F00}{x} \times \color{#007935}{proportion_{n}(x)} \right)\\ 
\end{equation}`
$$
]

.font90[
$$
`\begin{equation}
\overline{X}_{n} = \sum_{x}  \color{#FD5F00}{x} \times \color{#007935}{prop_{n}(x)}\\ 
\end{equation}`
$$

]

---
count:true

# Expected Value

- Let’s look at the histogram for the exercise above (drawn in the board) and pretend it is not a sample but the entire population. How can we move from frequencies into probabilities?

- Replace frequencies by probabilities

- The population version of the sample mean is the **expected value**.

---
# Expected Value: Definition (Discrete)

The expected value of a discrete random variable `$X$` is the weighted average of its `$k$` values `$\{x_1, \dots, x_k\}$` and their associated probabilities:

$$
`\begin{aligned}
\mathop{\mathbb{E}}(X) &= x_1 \mathop{\mathbb{P}}(X = x_1) + x_2 \mathop{\mathbb{P}}(X = x_2) + \dots +x_k \mathop{\mathbb{P}}(X = x_N) \\
&= \sum_{x} x\mathop{\mathbb{P}}(X = x)
\end{aligned}`
$$

- Also known as the .hi[population mean].

---
# Expected Value: Definition (Discrete)

The expected value of a discrete random variable `$X$` is the weighted average of its `$k$` values `$\{x_1, \dots, x_k\}$` and their associated probabilities:

$$
`\begin{aligned}
\mathop{\mathbb{E}}(X) &= x_1 \mathop{\mathbb{P}}(X = x_1) + x_2 \mathop{\mathbb{P}}(X = x_2) + \dots +x_k \mathop{\mathbb{P}}(X = x_k) \\
&= \sum_{x} \color{#FD5F00}{x} \color{#007935}{\mathop{\mathbb{P}}(X = x)} = \sum_{x} \color{#FD5F00}{x} \color{#007935}{f(x)}
\end{aligned}`
$$

- Also known as the .hi[population mean]. Compare it to the sample mean:

$$
`\begin{equation}
\overline{X}_{n} = \sum_{x}  \color{#FD5F00}{x} \times \color{#007935}{prop_{n}(x_{1})}\\ 
\end{equation}`
$$

---
# Expected Value

## Example

Rolling a six-sided die once can take values `$\{1, 2, 3, 4, 5, 6\}$`, each with equal probability. .hi-purple[What is the expected value of a roll?]

`$\mathop{\mathbb{E}}(\text{Roll}) = 1 \times \frac{1}{6} + 2 \times \frac{1}{6} + 3 \times \frac{1}{6} + 4 \times \frac{1}{6} + 5 \times \frac{1}{6} + 6 \times \frac{1}{6} = \color{#9370DB}{3.5}$`.

- __Note:__ The expected value can be a number that isn't a possible outcome of `$X$`.

---
# Expected Value. Definition (Continuous)

.pull-left[

If `$X$` is a continuous random variable and `$f(x)$` is its probability density function, then the expected value of `$X$` is

$$
\mathop{\mathbb{E}}(X) = \int_{-\infty}^{\infty} x f(x) dx.
$$

- __Note:__ `$x$` represents the particular values of `$X$`.

- Same idea as the discrete definition: describes the .hi[population mean].

]

.pull-right[

<img src="03_exp_sd_files/figure-html/unnamed-chunk-3-1.svg" style="display: block; margin: auto;" />
]

---
count:true
# Expected Value. Definition (Continuous)
- Compare it to the discrete version

- Continuous
$$
\mathop{\mathbb{E}}(X) = \int_{-\infty}^{\infty} x f(x) dx.
$$

- Discrete
$$
\mathop{\mathbb{E}}(X) = \sum_{x} \color{#FD5F00}{x} \color{#007935}{f(x)}
$$

---
# Expected Value. Definition (Continuous)
- Compare it to the discrete version

- Continuous
$$
\mathop{\mathbb{E}}(X) = \color{#9370DB}{\int_{-\infty}^{\infty}} \color{#FD5F00}{x} \color{#007935}{f(x)}  \color{#9370DB}{dx}.
$$

- Discrete
$$
\mathop{\mathbb{E}}(X) = \color{#9370DB}{\sum_{x}} \color{#FD5F00}{x} \color{#007935}{f(x)}
$$

.right[
This explanation was inspired by  
[this lecture from Eddie Woo](https://youtu.be/tF2Kns7RrfQ)
]

---
# Expected Value. Definition. One Last Thing 1/2

Let's go back to the mean of our exercise:

$$
`\begin{equation}
\overline{X}_{n} = \color{#FD5F00}{1} \times \color{#007935}{\frac{10}{30}} + 
         \color{#FD5F00}{2} \times \color{#007935}{\frac{9}{30}} + 
         \color{#FD5F00}{3} \times \color{#007935}{\frac{11}{30} }  =
                    \color{#9370DB}{2.03}\\
\end{equation}`
$$

But now let's switch the values of the random variables to: 10, 20, 30. How should we compute the mean?

--
$$
`\begin{equation}
\overline{g(X)}_{n} = \color{#FD5F00}{10} \times \color{#007935}{\frac{10}{30}} + 
         \color{#FD5F00}{20} \times \color{#007935}{\frac{9}{30}} + 
         \color{#FD5F00}{30} \times \color{#007935}{\frac{11}{30} }  =
                    \color{#9370DB}{20.33}\\
\end{equation}`
$$

---
# Expected Value. Definition. One Last Thing 2/2

Hence, we can conclude, that for a random variable `$X$`, any transformation `$g(X)$` has a sample aveage:

$$
`\begin{equation}
\overline{X}_{n} = \sum_{x}  \color{#FD5F00}{g(x)} \times \color{#007935}{prop_{n}(x_{1})}\\ 
\end{equation}`
$$

And an expectation:

$$
\mathop{\mathbb{E}}(g(X)) = \color{#9370DB}{\sum_{x}} \color{#FD5F00}{g(x)} \color{#007935}{f(x)}
$$

The same idea applies in the case of a continues random variable

---
# Expected Value: Rules (or Properties)

## Rule 1

For any constant `$c$`, `$\mathop{\mathbb{E}}(c) = c$`.

## Not-so-exciting examples

`$\mathop{\mathbb{E}}(5) = 5$`.

`$\mathop{\mathbb{E}}(1) = 1$`.

`$\mathop{\mathbb{E}}(4700) = 4700$`.

---
# Expected Value

## Rule 2

For any constants `$a$` and `$b$`, `$\mathop{\mathbb{E}}(aX + b) = a\mathop{\mathbb{E}}(X) + b$`.

## Example

Suppose `$X$` is the high temperature in degrees Celsius in Eugene during August. The long-run average is `$\mathop{\mathbb{E}}(X) = 28$`. If `$Y$` is the temperature in degrees Fahrenheit, then `$Y = 32 + \frac{9}{5} X$`. .hi-purple[What is] `$\color{#9370DB}{\mathop{\mathbb{E}}(Y)}$`.hi-purple[?]

- `$\mathop{\mathbb{E}}(Y) = 32 + \frac{9}{5} \mathop{\mathbb{E}}(X) = 32 + \frac{9}{5} \times 28 = \color{#9370DB}{82.4}$`.

---
# Expected Value

## Rule 3: Linearity

If `$\{a_1, a_2, \dots , a_n\}$` are constants and `$\{X_1, X_2, \dots , X_n\}$` are random variables, then

$$
\color{#FD5F00}{\mathop{\mathbb{E}}(a_1 X_1 + a_2 X_2 + \dots + a_n X_n)} = \color{#007935}{a_1 \mathop{\mathbb{E}}(X_1) + a_2 \mathop{\mathbb{E}}(X_2) + \dots + a_n \mathop{\mathbb{E}}(X_n)}.
$$

In English, .hi-orange[the expected value of the sum] .mono[=] .hi-green[the sum of expected values].

---
# Expected Value

## Rule 3

.hi-orange[The expected value of the sum] .mono[=] .hi-green[the sum of expected values].

## Example

Suppose that a coffee shop sells `$X_1$` small, `$X_2$` medium, and `$X_3$` large caffeinated beverages in a day. The quantities sold are random with expected values `$\mathop{\mathbb{E}}(X_1) = 43$`, `$\mathop{\mathbb{E}}(X_2) = 56$`, and `$\mathop{\mathbb{E}}(X_3) = 21$`. The prices of small, medium, and large beverages are `$1.75$`, `$2.50$`, and `$3.25$` dollars. .hi-purple[What is expected revenue?]

$$
`\begin{aligned}
\color{#FD5F00}{\mathop{\mathbb{E}}(1.75 X_1 + 2.50 X_2 + 3.35 X_n)} &= \color{#007935}{1.75 \mathop{\mathbb{E}}(X_1) + 2.50 \mathop{\mathbb{E}}(X_2) + 3.25 \mathop{\mathbb{E}}(X_3)} \\
&= \color{#9370DB}{1.75(43) + 2.50(56) + 3.25(21)} \\
&= \color{#9370DB}{283.5}
\end{aligned}`
$$

---
# Expected Value

## __Caution__

Previously, we found that the expected value of rolling a six-sided die is `$\mathop{\mathbb{E}} \left(\text{Roll} \right) = 3.5$`.

- If we square this number, we get `$\left[\mathop{\mathbb{E}} ( \text{Roll} ) \right]^2 = 12.25$`.

__Is__ `$\left[\mathop{\mathbb{E}} \left( \text{Roll} \right) \right]^2$` __the same as__ `$\mathop{\mathbb{E}} \left(\text{Roll}^2 \right)$`__?__

__No!__

$$
`\begin{aligned}
\mathop{\mathbb{E}} \left( \text{Roll}^2 \right) &= 1^2 \times \frac{1}{6} + 2^2 \times \frac{1}{6} + 3^2 \times \frac{1}{6} + 4^2 \times \frac{1}{6} + 5^2 \times \frac{1}{6} + 6^2 \times \frac{1}{6} \\ &\approx 15.167 \\ &\neq 12.25.
\end{aligned}`
$$

---
# Expected Value

## __Caution__

Except in special cases, .hi-purple[the transformation of an expected value] __is not__ .hi-green[the expected value of a transformed random variable].

For some function `$g(\cdot)$`, it is typically the case that

`$$\color{#9370DB}{g \left( \mathop{\mathbb{E}}(X) \right)} \neq \color{#007935}{\mathop{\mathbb{E}} \left( g(X) \right)}.$$`

---

# Activity 1
 - Let's watch [another Stat 110's video](https://youtu.be/sheoa3TrcCI). Then get together in groups of 3 and discuss:
    - Don't worry about the law of large numbers yet
    - How does the random variables becomes continuous?
    - How does linearity help with computations?