{"id":544,"date":"2015-08-01T12:51:23","date_gmt":"2015-08-01T11:51:23","guid":{"rendered":"http:\/\/pcool.dyndns.org:8080\/statsbook\/?page_id=544"},"modified":"2025-07-01T10:53:07","modified_gmt":"2025-07-01T09:53:07","slug":"regression-plot","status":"publish","type":"page","link":"https:\/\/pcool.dyndns.org\/index.php\/regression-plot\/","title":{"rendered":"Regression Plot"},"content":{"rendered":"\n<p>Essentially, a regression plot is a scatter plot with a fitted regression line. Regression lines could be linear, quadratic, and polynomial amongst others. The example below demonstrates how to create a linear regression plot for Anscombe&#8217;s first data set. Download the <a href=\"http:\/\/http2:\/\/pcool.dyndns.org:\/wp-content\/data_files\/anscombe.rda\" target=\"_blank\" rel=\"noreferrer noopener\">anscombe.rda<\/a> dataset for this example<sup class='sup-ref-note' id='note-zotero-ref-p544-r1-o1'><a class='sup-ref-note' href='#zotero-ref-p544-r1'>1<\/a><\/sup>.<\/p>\n\n\n\n<p>Create a <a href=\"https:\/\/pcool.dyndns.org\/index.php\/scatterplot\/\" data-type=\"page\" data-id=\"541\">scatterplot<\/a> as discussed:<\/p>\n\n\n\n<p>Anscombe\u2019s fictional data sets can be shown by:<\/p>\n\n\n\n<pre class=\"wp-block-code has-small-font-size\"><code><em><mark style=\"background-color:rgba(0, 0, 0, 0);color:#f80202\" class=\"has-inline-color\">anscombe.quartet\n<\/mark><mark style=\"background-color:rgba(0, 0, 0, 0);color:#1902f7\" class=\"has-inline-color\">   X1    Y1 X2   Y2 X3    Y3 X4    Y4\n1  10  8.04 10 9.14 10  7.46  8  6.58\n2   8  6.95  8 8.14  8  6.77  8  5.76\n3  13  7.58 13 8.74 13 12.74  8  7.71\n4   9  8.81  9 8.77  9  7.11  8  8.84\n5  11  8.33 11 9.26 11  7.81  8  8.47\n6  14  9.96 14 8.10 14  8.84  8  7.04\n7   6  7.24  6 6.13  6  6.08  8  5.25\n8   4  4.26  4 3.10  4  5.39 19 12.50\n9  12 10.84 12 9.13 12  8.15  8  5.56\n10  7  4.82  7 7.26  7  6.42  8  7.91\n11  5  5.68  5 4.74  5  5.73  8  6.89<\/mark><\/em><\/code><\/pre>\n\n\n\n<p>The first data set has X1 on the x -axis and Y1 on the y-axis. To create a scatterplot:<\/p>\n\n\n\n<pre class=\"wp-block-code has-small-font-size\"><code><span style=\"color: #ff0000;\"><em>regressionplot &lt;- ggplot(<span style=\"color: #ff0000;\"><em>data=anscombe.quartet,<\/em><\/span><\/em><\/span> <span style=\"color: #ff0000;\"><em>aes(x = X1,y = Y1)) +<\/em><\/span>\n<span style=\"color: #ff0000;\"><em>geom_point() +<\/em><\/span>\n<span style=\"color: #ff0000;\"><em>ggtitle(label = \"Anscombe \\'s First Data Set\") +<\/em><\/span>\n<span style=\"color: #ff0000;\"><em>theme_bw()<\/em><\/span><\/code><\/pre>\n\n\n\n<p class=\"is-style-text-annotation is-style-text-annotation--1\">The backslash \\ before the \u2018s is required so the quotation mark does not indicate the end of the title\u2019s text string, but that the quotation mark is part of the title!<\/p>\n\n\n\n<p>To add a regression line with a 95% confidence interval:<\/p>\n\n\n\n<pre class=\"wp-block-code has-small-font-size\"><code><span style=\"color: #ff0000;\">regressionplot &lt;- regressionplot + <\/span>\n<span style=\"color: #ff0000;\">geom_smooth(aes(x=X1, y=Y1), data=anscombe.quartet, method = 'lm')<\/span>\n<span style=\"color: #ff0000;\">regressionplot<\/span><\/code><\/pre>\n\n\n\n<p>Will show the plot:<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"768\" src=\"https:\/\/pcool.dyndns.org\/wp-content\/uploads\/2025\/06\/regressionplot1-1024x768.png\" alt=\"\" class=\"wp-image-3603\" srcset=\"https:\/\/pcool.dyndns.org\/wp-content\/uploads\/2025\/06\/regressionplot1-1024x768.png 1024w, https:\/\/pcool.dyndns.org\/wp-content\/uploads\/2025\/06\/regressionplot1-300x225.png 300w, https:\/\/pcool.dyndns.org\/wp-content\/uploads\/2025\/06\/regressionplot1-768x576.png 768w, https:\/\/pcool.dyndns.org\/wp-content\/uploads\/2025\/06\/regressionplot1.png 1355w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<p>Or without a 95% confidence interval:<\/p>\n\n\n\n<pre class=\"wp-block-code has-small-font-size\"><code><span style=\"color: #ff0000;\"><em>regressionplot2 &lt;- ggplot(<span style=\"color: #ff0000;\"><em>data=anscombe.quartet,<\/em><\/span> aes(x = X1,y = Y1)) +<\/em><\/span>\n<span style=\"color: #ff0000;\"><em>geom_point() + <\/em><\/span>\n<span style=\"color: #ff0000;\"><em>geom_smooth(method = 'lm', se = FALSE) +<\/em><\/span>\n<span style=\"color: #ff0000;\"><em>ggtitle(label = \"Anscombe \\'s First Data Set\") +<\/em><\/span>\n<span style=\"color: #ff0000;\"><em>theme_bw()<\/em><\/span>\n<em><span style=\"color: #ff0000;\">regressionplot2<\/span><\/em><\/code><\/pre>\n\n\n\n<p>Will show the plot:<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"768\" src=\"https:\/\/pcool.dyndns.org\/wp-content\/uploads\/2025\/06\/regressionplot2-1024x768.png\" alt=\"\" class=\"wp-image-3604\" srcset=\"https:\/\/pcool.dyndns.org\/wp-content\/uploads\/2025\/06\/regressionplot2-1024x768.png 1024w, https:\/\/pcool.dyndns.org\/wp-content\/uploads\/2025\/06\/regressionplot2-300x225.png 300w, https:\/\/pcool.dyndns.org\/wp-content\/uploads\/2025\/06\/regressionplot2-768x576.png 768w, https:\/\/pcool.dyndns.org\/wp-content\/uploads\/2025\/06\/regressionplot2.png 1355w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<p>It is customary to put the independent (explanatory or predictor) variable on the x-axis (abscissa) and the dependent (response or outcome) variable on the y-axis (ordinate). However, it is not always clear which variable is dependent and which independent.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Essentially, a regression plot is a scatter plot with a fitted regression line. Regression lines could be linear, quadratic, and polynomial amongst others. The example below demonstrates how to create a linear regression plot for Anscombe&#8217;s first data set. Download the anscombe.rda dataset for this example. Create a scatterplot as discussed: Anscombe\u2019s fictional data sets [&hellip;]<\/p>\n","protected":false},"author":2,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"inline_featured_image":false,"footnotes":""},"class_list":["post-544","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/pcool.dyndns.org\/index.php\/wp-json\/wp\/v2\/pages\/544","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/pcool.dyndns.org\/index.php\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/pcool.dyndns.org\/index.php\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/pcool.dyndns.org\/index.php\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/pcool.dyndns.org\/index.php\/wp-json\/wp\/v2\/comments?post=544"}],"version-history":[{"count":3,"href":"https:\/\/pcool.dyndns.org\/index.php\/wp-json\/wp\/v2\/pages\/544\/revisions"}],"predecessor-version":[{"id":4694,"href":"https:\/\/pcool.dyndns.org\/index.php\/wp-json\/wp\/v2\/pages\/544\/revisions\/4694"}],"wp:attachment":[{"href":"https:\/\/pcool.dyndns.org\/index.php\/wp-json\/wp\/v2\/media?parent=544"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}