<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:georss='http://www.georss.org/georss' xmlns:gd='http://schemas.google.com/g/2005' xmlns:thr='http://purl.org/syndication/thread/1.0'><id>tag:blogger.com,1999:blog-7444616934468721198</id><updated>2012-02-16T05:00:35.614-08:00</updated><category term='resume'/><title type='text'>Paul Clayton</title><subtitle type='html'></subtitle><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://rainriversea.blogspot.com/feeds/posts/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7444616934468721198/posts/default?max-results=100'/><link rel='alternate' type='text/html' href='http://rainriversea.blogspot.com/'/><link rel='hub' href='http://pubsubhubbub.appspot.com/'/><author><name>Paul Clayton</name><uri>http://www.blogger.com/profile/12852763566980520419</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://3.bp.blogspot.com/_FdlS-DRoyac/SSoKH9aDk0I/AAAAAAAAAAM/dwbDlKOhZ50/S220/IMG_1305.jpg'/></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>4</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>100</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-7444616934468721198.post-2035154773964653910</id><published>2008-11-26T23:34:00.001-08:00</published><updated>2008-11-28T00:24:51.317-08:00</updated><title type='text'>R-values 1, gut instinct 0</title><content type='html'>I wrote the following paper in 2007 while studying econometrics with Professor Dick Startz.  Statistical methods can be powerful tools, notwithstanding the old line about lies, damn lies, and statistics.  The analysis below is just the tip of the iceberg in how data can be used in politics.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold; text-decoration: underline;"&gt;Improving Voter Targeting by Political Campaigns&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Executive Summary&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Political campaigns use a variety of methods to target the voters they want to contact.  During the 2006 election cycle I worked with the Darcy Burner campaign in a support role, and saw how they used various types of data to choose which voters to pursue.  It was striking to me that statistical analysis and methodological sampling were not part of the plan.  The campaign relied instead on a large army of volunteers to contact as many voters as possible within a fairly wide range of potential contacts.&lt;br /&gt;&lt;span id="fullpost"&gt;&lt;br /&gt;I have taken a look in hindsight at how well the prediction variables performed.  I investigated two primary areas, the likelihood of a voter to return a ballot, and the likelihood that a voter would be a supporter.  I regressed ballot returns on past vote history, and found a good correlation.  The campaign correctly used past vote history as a predictor, although my model improves the predictive value by assigning weights to each past election.  Three variables the campaign relied on to determine support included age, sex, and “ST score”.  I found that there was a statistically significant correlation between each of these variables and the average support level for the candidate, which is what the campaign would hope for.  However, the overall amount of variability in support explained by these three variables is quite low, with an R Squared of only 0.018.  This leaves enormous room for improvement in the future by campaigns who are willing and able to apply statistical methods to the data they are collecting.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Introduction &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Identification of likely supporters is a critical component of running a successful political campaign.  The identification effort runs for months or even years in advance of an election, and culminates in a massive effort to encourage supporters to get to the polls and vote on the day of the election.  In political lingo, this is known as GOTV (get out the vote.)  It is often the effectiveness of the GOTV effort that determines the victor in a close race.  During the 2006 election cycle, one of the closest and most contested races in the nation was here in Washington in the 8th Congressional District, pitting incumbent Dave Reichert ( R) against Darcy Burner (D).  The final result was a win for Reichert, who garnered 51.5% of the vote.  These two candidates are slated for a rematch in the 2008 election, and the race is widely expected to be extremely close once again.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Methodology and Definitions:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Data transformation was performed on the data, assigning numeric values to text fields for easier analysis.  Each voter in the database has several pieces of corresponding data.  The simplest includes basic demographic information such as age and sex.  Each voter also has a complete vote history for the past 6 years, for both primary and general elections.  All vote history data is entered as a 1 or 0, where 1 indicates the voter turned in a ballot and 0 is no ballot returned.  The “ST score” is a score from 0 to 100 which was provided by an outside vendor, based upon a variety of proprietary demographic information.  The “Pre 2006 Grade” is a ranking from 1 to 5 which tracks the partisan affiliation of a voter based on voter contact prior to the 2006 election.  These grades range from 1 to 5, with a 1 indicating a partisan Republican, a 5 being a partisan Democrat, 3 is Independent, and 2 or 4 indicate someone leaning toward a party.  “Cantwell ID” is a ranking from 1 to 5, with 1 indicating a strong opponent and 5 a strong supporter of Senator Maria Cantwell.  Sex was converted to a numeric data field also, setting female=0 and male=1.&lt;br /&gt;All regressions were run on the same random sample of 45,097 voters from the 8th Congressional District.  This reduces by 5,000 voters the number sampled in my initial investigation.  These voters were removed from the random overall sample of 50,000 due to not having been assigned an “ST Score” from the original vendor, thus a different scoring methodology was used on this subset and these voters should not have been included in my initial analysis.  Also, my initial investigation only had ST Score data grouped into broad categories, whereas this improved analysis uses raw data scored from 0 to 100 for each individual voter.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Goals:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;· Create a model to help campaigns identify groups of voters who are more likely to be supporters and to eliminate groups of voters who are likely opponents. &lt;br /&gt;· Investigate the relationship between past vote history and whether a voter returned a ballot during the 2006 general election.&lt;br /&gt;· Look at how accurate the “ST score” was in terms of predicting whether a voter would actually be a supporter of Darcy Burner.&lt;br /&gt;· Look at age and sex as possible factors in determining support for Darcy Burner.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Results&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;First, I calculated the average support level for the sample.  The average of 3.36 is a bit above 3, which is the undecided level.(1)  This does not mean the average support level across all voters is actually greater than 3 however, since the support level is likely skewed upwards.  This is due to the effort by the campaign to identify supporters, and their likely exclusion of groups of likely opponents from voter contact efforts.  Voter contact is a time consuming and expensive process, and only a fraction of eligible voters ever wind up with a known support level.  As the included number of observations shows, only 6980 voters in this sample of 50,000 voters were tagged with a support level.&lt;br /&gt;&lt;br /&gt;Next, I looked at the predictive value of the past party affiliation data.(2)  A statistically significant positive correlation exists between voter grade and support level.  The voter grade from prior to 2006 predicts 16% of the variation in the level of candidate support for Senator Cantwell.  This shows that while voter grade can be helpful in identifying supporters, it is only a small piece of the puzzle and cannot be relied upon exclusively.&lt;br /&gt;&lt;br /&gt;The next set of data I was interested in exploring were three variables which were used together during the campaign to search for potential supporters- Age, sex, and the ST score.(3)  While all of these variables are statistically significant, the overall R-square of .018 is very low.  This is slightly increased from .015 from my initial investigation.  I definitely expected to see a bigger improvement as I moved from the roughly categorized scores I used initially to the specific scores here, which are stated to two decimal places!  Thus, the use of these predictors only marginally improves on finding supporters over a purely random sample.  There does appear to be a higher level of support for Darcy Burner amongst women (sex=0, coefficient=1.87) than amongst men (sex=1, coefficient=1.63).  The campaign did work under the assumption that Burner was likely to find support amongst women rather than men, given she was up against a male opponent, and this assumption appears correct.&lt;br /&gt;&lt;br /&gt;Campaigns often focus their GOTV efforts on voters who are in the middle range of turnout likelihood.  It is assumed that it is wasted money to try to contact someone who is very unlikely to vote, and likewise someone who is guaranteed to vote.  The standard predictors used in making a determination of likelihood to vote are the recorded vote history.  It was assumed that general election turnout was a more important indicator than primary turnout, particularly since the focus was on the upcoming general election.  In hindsight, we can regress the general election turnout on the complete vote history for the previous 6 years worth of voting to determine the estimated coefficients for each election and use this to estimate the likelihood to turnout.(4)  The data confirms that general elections tend to be more significant than primaries overall, and that more recent elections are more significant than older elections.  Thus, the general model the campaign was using was correct, although it could be more finely tuned by using this new model to assign real weights to each election.  Notably, the R squared is .42, which intuitively seems about correct.  It would be unlikely to find a model that could predict with near certainty whether an individual voter would actually vote, but this model does provide some good guidance in terms of general trends.&lt;br /&gt;&lt;br /&gt;In conclusion, the results of this analysis show that the campaign was making decent educated guesses about what factors to look at when trying to direct their efforts.  The piece of information they were lacking though was how significant each variable was.  Without knowing how to weigh age versus sex, or ST score, the campaign was left making important decisions based on gut instinct.  Moreover, they were not aware how well their predictor variables were performing, and thus had no incentive to search for more significant variables.  It is very possible that a concerted use of regression analysis along with a careful plan to sample cross sections of the voting populace would have led to more efficient use of resources.  For example, the campaign made an effort to target young women.  Looking at the regression of support on age, expanded to show all ages by year, it appears the support level for younger women is actually lower than that of older women.  (5)  The sample size is small enough that the standard errors for these coefficients introduce some level of uncertainty with this result, but the data does seem to indicate a real trend here.  I removed 19 year olds and those over 93 due to high standard errors and created a scatter plot to illustrate this trend, with a fitted regression line.  It becomes quite clear now that targeting younger women may not have been an optimal campaign strategy.&lt;br /&gt;&lt;br /&gt;This scatter plot illustrates how regression analysis could have improved voter targeting and potentially changed the outcome of a Congressional race, by focusing on older women rather than younger women.  Last, to return to a previous point, the ST Score which was given great credence during the campaign cycle explained less than 2 % of the variation in a voter’s support level.  The fact that the ST Score was reported to two decimal places creates an impression that it is extremely precise, and that it can be used in a very fine manner to target voters.  In reality, there was almost no difference between broad categories of the ST Score and the very fine scoring used in my second analysis.  This suggests that campaign directors should take modeling scores with a grain of salt at best, at least until a real world verification of their effectiveness has been performed.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Data Output&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;1. ls us_house_of_represen c&lt;br /&gt;&lt;br /&gt;Dependent Variable: US_HOUSE_OF_REPRESEN&lt;br /&gt;Method: Least Squares&lt;br /&gt;Sample: 1 50000 IF STRATTELEMETRY_SCORE&gt;0&lt;br /&gt;Included observations: 6980&lt;br /&gt;&lt;br /&gt; Coefficient Std. Error t-Statistic Prob.&lt;br /&gt; C 3.359026 0.019662 170.8387 0.0000&lt;br /&gt;&lt;br /&gt;R-squared 0.000000     Mean dependent var 3.359026&lt;br /&gt;Adjusted R-squared 0.000000     S.D. dependent var 1.642687&lt;br /&gt;S.E. of regression 1.642687     Akaike info criterion 3.830687&lt;br /&gt;Sum squared resid 18832.28     Schwarz criterion 3.831669&lt;br /&gt;Log likelihood -13368.10     Hannan-Quinn criter. 3.831025&lt;br /&gt;Durbin-Watson stat 1.952899&lt;br /&gt;&lt;br /&gt; 2.    ls us_house_of_represen c pre2006_grades&lt;br /&gt;&lt;br /&gt;Dependent Variable: US_HOUSE_OF_REPRESEN&lt;br /&gt;Method: Least Squares&lt;br /&gt;Sample: 1 50000 IF STRATTELEMETRY_SCORE&gt;0&lt;br /&gt;Included observations: 3696&lt;br /&gt;&lt;br /&gt; Coefficient Std. Error t-Statistic Prob.&lt;br /&gt; C 0.930433 0.100374 9.269688 0.0000&lt;br /&gt;PRE2006_GRADES 0.781945 0.028890 27.06640 0.0000&lt;br /&gt;  &lt;br /&gt;R-squared 0.165498     Mean dependent var 3.565476&lt;br /&gt;Adjusted R-squared 0.165272     S.D. dependent var 1.625753&lt;br /&gt;S.E. of regression 1.485344     Akaike info criterion 3.629711&lt;br /&gt;Sum squared resid 8149.880     Schwarz criterion 3.633074&lt;br /&gt;Log likelihood -6705.707     Hannan-Quinn criter. 3.630908&lt;br /&gt;F-statistic 732.5898     Durbin-Watson stat 2.136515&lt;br /&gt;Prob(F-statistic) 0.000000&lt;br /&gt;  &lt;br /&gt;3. ls us_house_of_represen @expand(sex) age strattelemetry_score&lt;br /&gt;&lt;br /&gt;Dependent Variable: US_HOUSE_OF_REPRESEN&lt;br /&gt;Method: Least Squares&lt;br /&gt;Sample: 1 50000 IF STRATTELEMETRY_SCORE&gt;0&lt;br /&gt;Included observations: 6974&lt;br /&gt;&lt;br /&gt;Coefficient Std. Error t-Statistic Prob.&lt;br /&gt;     AGE 0.007527 0.001264 5.954138 0.0000&lt;br /&gt;STRATTELEMETRY_SCORE 0.024752 0.003550 6.973313 0.0000&lt;br /&gt;SEX=0 1.870616 0.186871 10.01023 0.0000&lt;br /&gt;SEX=1 1.635649 0.183821 8.898059 0.0000&lt;br /&gt;&lt;br /&gt; R-squared 0.018417     Mean dependent var 3.359048&lt;br /&gt;Adjusted R-squared 0.017995     S.D. dependent var 1.642550&lt;br /&gt;S.E. of regression 1.627704     Akaike info criterion 3.812791&lt;br /&gt;Sum squared resid 18466.46     Schwarz criterion 3.816720&lt;br /&gt;Log likelihood -13291.20     Hannan-Quinn criter. 3.814145&lt;br /&gt;Durbin-Watson stat 1.974734&lt;br /&gt;&lt;br /&gt;4. LS GENERAL06 C GENERAL05 GENERAL04  GENERAL03 GENERAL02 GENERAL01 GENERAL00 PRIMARY06 PRIMARY05 PRIMARY04 PRIMARY03 PRIMARY02 PRIMARY01 PRIMARY00&lt;br /&gt;&lt;br /&gt;Dependent Variable: GENERAL06&lt;br /&gt;Method: Least Squares&lt;br /&gt;Sample: 1 50000 IF STRATTELEMETRY_SCORE&gt;0&lt;br /&gt;Included observations: 45097&lt;br /&gt;&lt;br /&gt; Coefficient Std. Error t-Statistic Prob.&lt;br /&gt; C 0.147413 0.004184 35.23009 0.0000&lt;br /&gt;GENERAL05 0.283711 0.004473 63.43001 0.0000&lt;br /&gt;GENERAL04 0.206254 0.005198 39.68120 0.0000&lt;br /&gt;GENERAL03 0.014962 0.005228 2.861771 0.0042&lt;br /&gt;GENERAL02 0.108505 0.005184 20.93248 0.0000&lt;br /&gt;GENERAL01 0.023894 0.005332 4.481124 0.0000&lt;br /&gt;GENERAL00 0.042168 0.004811 8.765151 0.0000&lt;br /&gt;PRIMARY06 0.191973 0.004664 41.15622 0.0000&lt;br /&gt;PRIMARY05 0.022330 0.004948 4.512808 0.0000&lt;br /&gt;PRIMARY04 0.066253 0.004644 14.26506 0.0000&lt;br /&gt;PRIMARY03 -0.018960 0.005473 -3.464221 0.0005&lt;br /&gt;PRIMARY02 -0.036906 0.005503 -6.705928 0.0000&lt;br /&gt;PRIMARY01 -0.032836 0.005603 -5.860115 0.0000&lt;br /&gt;PRIMARY00 0.001494 0.005162 0.289372 0.7723&lt;br /&gt;&lt;br /&gt; R-squared 0.422976     Mean dependent var 0.656163&lt;br /&gt;Adjusted R-squared 0.422810     S.D. dependent var 0.474993&lt;br /&gt;S.E. of regression 0.360866     Akaike info criterion 0.799692&lt;br /&gt;Sum squared resid 5870.913     Schwarz criterion 0.802398&lt;br /&gt;Log likelihood -18017.86     Hannan-Quinn criter. 0.800544&lt;br /&gt;F-statistic 2542.093     Durbin-Watson stat 1.961476&lt;br /&gt;Prob(F-statistic) 0.000000&lt;br /&gt;&lt;br /&gt; 5. smpl if strattelemetry_score&gt;0 and sex=0 ls us_house_of_represen @expand(age)&lt;br /&gt;&lt;br /&gt;Dependent Variable: US_HOUSE_OF_REPRESEN&lt;br /&gt;Method: Least Squares&lt;br /&gt;Sample: 1 50000 IF STRATTELEMETRY_SCORE&gt;0 AND SEX=0&lt;br /&gt;Included observations: 3865&lt;br /&gt;&lt;br /&gt; Coefficient Std. Error t-Statistic Prob.&lt;br /&gt; AGE=19 3.000000 1.609292 1.864174 0.0624&lt;br /&gt;AGE=20 3.000000 0.608255 4.932141 0.0000&lt;br /&gt;AGE=21 3.181818 0.280142 11.35789 0.0000&lt;br /&gt;AGE=22 3.666667 0.280142 13.08861 0.0000&lt;br /&gt;AGE=23 3.060606 0.280142 10.92521 0.0000&lt;br /&gt;AGE=24 3.000000 0.369197 8.125747 0.0000&lt;br /&gt;AGE=25 3.592593 0.309708 11.59992 0.0000&lt;br /&gt;AGE=26 3.800000 0.321858 11.80644 0.0000&lt;br /&gt;AGE=27 3.782609 0.335561 11.27251 0.0000&lt;br /&gt;AGE=28 3.272727 0.343102 9.538638 0.0000&lt;br /&gt;AGE=29 3.533333 0.293815 12.02570 0.0000&lt;br /&gt;AGE=30 3.230769 0.315608 10.23665 0.0000&lt;br /&gt;AGE=31 2.925926 0.309708 9.447359 0.0000&lt;br /&gt;AGE=32 3.500000 0.284485 12.30292 0.0000&lt;br /&gt;AGE=33 3.450000 0.254451 13.55858 0.0000&lt;br /&gt;AGE=34 2.875000 0.232281 12.37724 0.0000&lt;br /&gt;AGE=35 3.313725 0.225346 14.70506 0.0000&lt;br /&gt;AGE=36 3.254545 0.216997 14.99812 0.0000&lt;br /&gt;AGE=37 3.028986 0.193736 15.63461 0.0000&lt;br /&gt;AGE=38 3.250000 0.179924 18.06315 0.0000&lt;br /&gt;AGE=39 3.461538 0.182216 18.99685 0.0000&lt;br /&gt;AGE=40 3.465753 0.188353 18.40028 0.0000&lt;br /&gt;AGE=41 3.202247 0.170585 18.77220 0.0000&lt;br /&gt;AGE=42 3.393939 0.198090 17.13331 0.0000&lt;br /&gt;AGE=43 3.584270 0.170585 21.01169 0.0000&lt;br /&gt;AGE=44 3.224490 0.162563 19.83532 0.0000&lt;br /&gt;AGE=45 3.524272 0.158568 22.22559 0.0000&lt;br /&gt;AGE=46 3.377358 0.156308 21.60704 0.0000&lt;br /&gt;AGE=47 3.301887 0.156308 21.12420 0.0000&lt;br /&gt;AGE=48 3.291262 0.158568 20.75613 0.0000&lt;br /&gt;AGE=49 3.240741 0.154854 20.92770 0.0000&lt;br /&gt;AGE=50 3.385965 0.150724 22.46467 0.0000&lt;br /&gt;AGE=51 3.591304 0.150067 23.93130 0.0000&lt;br /&gt;AGE=52 3.261682 0.155576 20.96519 0.0000&lt;br /&gt;AGE=53 3.344828 0.172534 19.38646 0.0000&lt;br /&gt;AGE=54 3.471698 0.156308 22.21059 0.0000&lt;br /&gt;AGE=55 3.741573 0.170585 21.93383 0.0000&lt;br /&gt;AGE=56 3.543478 0.167780 21.11976 0.0000&lt;br /&gt;AGE=57 3.386364 0.171551 19.73968 0.0000&lt;br /&gt;AGE=58 3.705882 0.174552 21.23080 0.0000&lt;br /&gt;AGE=59 3.826667 0.185825 20.59285 0.0000&lt;br /&gt;AGE=60 3.694737 0.165110 22.37745 0.0000&lt;br /&gt;AGE=61 3.333333 0.175588 18.98382 0.0000&lt;br /&gt;AGE=62 3.371429 0.192347 17.52783 0.0000&lt;br /&gt;AGE=63 3.774194 0.204380 18.46653 0.0000&lt;br /&gt;AGE=64 3.794521 0.188353 20.14576 0.0000&lt;br /&gt;AGE=65 3.615385 0.199608 18.11242 0.0000&lt;br /&gt;AGE=66 4.050000 0.254451 15.91660 0.0000&lt;br /&gt;AGE=67 3.500000 0.254451 13.75508 0.0000&lt;br /&gt;AGE=68 3.765957 0.234739 16.04315 0.0000&lt;br /&gt;AGE=69 3.545455 0.242610 14.61381 0.0000&lt;br /&gt;AGE=70 3.800000 0.254451 14.93409 0.0000&lt;br /&gt;AGE=71 3.163265 0.229899 13.75938 0.0000&lt;br /&gt;AGE=72 4.000000 0.261062 15.32206 0.0000&lt;br /&gt;AGE=73 3.418605 0.245415 13.92991 0.0000&lt;br /&gt;AGE=74 3.400000 0.293815 11.57190 0.0000&lt;br /&gt;AGE=75 3.731707 0.251329 14.84789 0.0000&lt;br /&gt;AGE=76 4.100000 0.254451 16.11310 0.0000&lt;br /&gt;AGE=77 3.250000 0.328495 9.893597 0.0000&lt;br /&gt;AGE=78 3.250000 0.284485 11.42414 0.0000&lt;br /&gt;AGE=79 3.608696 0.335561 10.75423 0.0000&lt;br /&gt;AGE=80 3.842105 0.369197 10.40666 0.0000&lt;br /&gt;AGE=81 3.516129 0.289037 12.16497 0.0000&lt;br /&gt;AGE=82 4.230769 0.315608 13.40514 0.0000&lt;br /&gt;AGE=83 4.130435 0.335561 12.30906 0.0000&lt;br /&gt;AGE=84 3.750000 0.328495 11.41569 0.0000&lt;br /&gt;AGE=85 3.666667 0.328495 11.16201 0.0000&lt;br /&gt;AGE=86 4.066667 0.415517 9.786997 0.0000&lt;br /&gt;AGE=87 3.400000 0.359849 9.448419 0.0000&lt;br /&gt;AGE=88 4.750000 0.568971 8.348411 0.0000&lt;br /&gt;AGE=89 3.666667 0.656991 5.581003 0.0000&lt;br /&gt;AGE=90 4.000000 0.568971 7.030241 0.0000&lt;br /&gt;AGE=91 4.600000 0.719697 6.391578 0.0000&lt;br /&gt;AGE=92 3.666667 0.656991 5.581003 0.0000&lt;br /&gt;AGE=93 3.400000 0.719697 4.724210 0.0000&lt;br /&gt;AGE=94 4.000000 1.137941 3.515121 0.0004&lt;br /&gt;AGE=95 3.000000 1.609292 1.864174 0.0624&lt;br /&gt;AGE=96 1.000000 1.609292 0.621391 0.5344&lt;br /&gt;AGE=97 5.000000 1.609292 3.106957 0.0019&lt;br /&gt;AGE=98 5.000000 1.609292 3.106957 0.0019&lt;br /&gt;AGE=99 5.000000 1.609292 3.106957 0.0019&lt;br /&gt;&lt;br /&gt;R-squared 0.027811     Mean dependent var 3.477102&lt;br /&gt;Adjusted R-squared 0.007258     S.D. dependent var 1.615164&lt;br /&gt;S.E. of regression 1.609292     Akaike info criterion 3.810200&lt;br /&gt;Sum squared resid 9799.878     Schwarz criterion 3.941387&lt;br /&gt;Log likelihood -7282.211     Hannan-Quinn criter. 3.856783&lt;br /&gt;Durbin-Watson stat 1.903265&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7444616934468721198-2035154773964653910?l=rainriversea.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://rainriversea.blogspot.com/feeds/2035154773964653910/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7444616934468721198&amp;postID=2035154773964653910&amp;isPopup=true' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7444616934468721198/posts/default/2035154773964653910'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7444616934468721198/posts/default/2035154773964653910'/><link rel='alternate' type='text/html' href='http://rainriversea.blogspot.com/2008/11/statisticians-get-no-respect.html' title='R-values 1, gut instinct 0'/><author><name>Paul Clayton</name><uri>http://www.blogger.com/profile/12852763566980520419</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://3.bp.blogspot.com/_FdlS-DRoyac/SSoKH9aDk0I/AAAAAAAAAAM/dwbDlKOhZ50/S220/IMG_1305.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7444616934468721198.post-1243580611505603658</id><published>2008-11-26T22:43:00.000-08:00</published><updated>2008-11-26T23:05:26.038-08:00</updated><title type='text'>"Grant Hadwin got a chainsaw and did something terrible"</title><content type='html'>So begins a fascinating story of idealism and uncompromising zeal, set on the bank of the Yakoun River in British Columbia.  This is a story which leaves you wondering about the meaning of heroism, and of folly.&lt;br /&gt;&lt;br /&gt;I'm not sure if the protagonist is Grant Hadwin or the singular golden spruce.&lt;br /&gt;&lt;br /&gt;I recently carved a kayak paddle out of Sitka Spruce; I have a different feel for this tree now that I've felt how tough spruce fibers can be.  I highly recommend this story and, thanks to the internet, it's available for &lt;a href="http://books.google.com/books?id=7TctFZbIWaMC&amp;amp;pg=PAo&amp;amp;dq=THE+GOLDEN+BOUGH,+by+JOHN+VAILLANT&amp;amp;ei=K0QuSfp7j_aUBLbJ9P4D#PPP1,M1"&gt;preview on Google Books&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://books.google.com/books?id=7TctFZbIWaMC&amp;amp;pg=PAo&amp;amp;dq=THE+GOLDEN+BOUGH,+by+JOHN+VAILLANT&amp;amp;ei=K0QuSfp7j_aUBLbJ9P4D#PPP1,M1"&gt;&lt;img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer; width: 216px; height: 327px;" src="http://www.cbc.ca/arts/images/pics/goldenspruce2.jpg" alt="" border="0" /&gt;&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7444616934468721198-1243580611505603658?l=rainriversea.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://rainriversea.blogspot.com/feeds/1243580611505603658/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7444616934468721198&amp;postID=1243580611505603658&amp;isPopup=true' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7444616934468721198/posts/default/1243580611505603658'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7444616934468721198/posts/default/1243580611505603658'/><link rel='alternate' type='text/html' href='http://rainriversea.blogspot.com/2008/11/grant-hadwin-got-chainsaw-and-did.html' title='&quot;Grant Hadwin got a chainsaw and did something terrible&quot;'/><author><name>Paul Clayton</name><uri>http://www.blogger.com/profile/12852763566980520419</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://3.bp.blogspot.com/_FdlS-DRoyac/SSoKH9aDk0I/AAAAAAAAAAM/dwbDlKOhZ50/S220/IMG_1305.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7444616934468721198.post-1389422335233321893</id><published>2008-11-26T21:49:00.000-08:00</published><updated>2008-11-26T22:37:49.819-08:00</updated><title type='text'>Real time home energy monitoring</title><content type='html'>Imagine a gas pump that didn't tell you the price of the gas you'd just pumped.  In this country, I imagine that would cause a near-riot.&lt;br /&gt;&lt;br /&gt;Wait, that sounds like my electricity meter!  After an $800 utility bill, I became intimately familiar with that meter.&lt;br /&gt;&lt;br /&gt;So, in 2005 (and in shock), I put together a simple database to track my own electricity usage.  The variation in my daily and hourly usage was enormous.  Even modest changes in weather had a big impact.  My house used electric radiant heat, and we must have been &lt;b&gt;very&lt;/b&gt; popular down at Seattle City Light.&lt;br /&gt;&lt;br /&gt;Having access to real-time feedback enabled me to reduce my energy use noticeably.  Sometimes, information can be as powerful a tool as extra weather stripping.  Commercial versions of real-time energy monitors are available, though fairly expensive at the retail level.  I have not seen any examples of utilities promoting or subsidizing energy information systems.  I feel this could be a wise and cost effective option to pursue for utilities and governmental agencies.&lt;br /&gt;&lt;br /&gt;Here's a graph of my daily electricity usage, put into terms we can all understand:  $$$$$$$&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/_FdlS-DRoyac/SS46Hv2RW_I/AAAAAAAAAA4/h0L8WTdrpPI/s1600-h/KW2.jpg"&gt;&lt;img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer; width: 400px; height: 213px;" src="http://4.bp.blogspot.com/_FdlS-DRoyac/SS46Hv2RW_I/AAAAAAAAAA4/h0L8WTdrpPI/s400/KW2.jpg" alt="" id="BLOGGER_PHOTO_ID_5273216118475021298" border="0" /&gt;&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7444616934468721198-1389422335233321893?l=rainriversea.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://rainriversea.blogspot.com/feeds/1389422335233321893/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7444616934468721198&amp;postID=1389422335233321893&amp;isPopup=true' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7444616934468721198/posts/default/1389422335233321893'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7444616934468721198/posts/default/1389422335233321893'/><link rel='alternate' type='text/html' href='http://rainriversea.blogspot.com/2008/11/real-time-home-energy-monitoring.html' title='Real time home energy monitoring'/><author><name>Paul Clayton</name><uri>http://www.blogger.com/profile/12852763566980520419</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://3.bp.blogspot.com/_FdlS-DRoyac/SSoKH9aDk0I/AAAAAAAAAAM/dwbDlKOhZ50/S220/IMG_1305.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/_FdlS-DRoyac/SS46Hv2RW_I/AAAAAAAAAA4/h0L8WTdrpPI/s72-c/KW2.jpg' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7444616934468721198.post-7111843654245137693</id><published>2008-11-23T17:59:00.000-08:00</published><updated>2009-02-23T16:22:58.917-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='resume'/><title type='text'>Abbreviated resume</title><content type='html'>&lt;span style="color: rgb(0, 0, 0); font-weight: bold;"&gt;Technical Skills&lt;/span&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;SQL, MySQL, MS Access, Excel, HTML, CSS, ASP, Visual Basic, Photoshop, Javascript&lt;/li&gt;&lt;/ul&gt;&lt;br /&gt;&lt;span style="color: rgb(0, 0, 0); font-weight: bold;"&gt;Website design&lt;/span&gt;&lt;br /&gt;&lt;ul style="color: rgb(102, 102, 102);"&gt;&lt;li&gt;&lt;a href="http://www.fisheyeguyphotography.com"&gt;Fish Eye Guy Photography&lt;/a&gt;&lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.denniskirkland.com/"&gt;Dennis Kirkland Photography&lt;/a&gt;&lt;span style="text-decoration: underline;"&gt;&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span style="text-decoration: underline;"&gt;&lt;a href="http://www.planetorganics.com/"&gt;Planet Organics&lt;/a&gt;&lt;br /&gt;&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;Summary:  I enjoy databases.  I am highly skilled in database architecture, including creation, modification and restoration.  I am well versed in ETL procedures utilizing various data formats, as well as setup, optimization and maintenance of indexes.  I enjoy solid, meticulous, and optimized query writing.  My university studies included: statistics, econometrics, advanced calculus, and game theory.&lt;br /&gt;&lt;br /&gt;&lt;span style="color: rgb(0, 0, 0); font-weight: bold;"&gt;Education&lt;/span&gt;&lt;br /&gt;Bachelor of Science, Economics, University of Washington&lt;br /&gt;&lt;br /&gt;&lt;span style="color: rgb(0, 0, 0); font-weight: bold;"&gt;Employment&lt;/span&gt;&lt;br /&gt;&lt;span style="color: rgb(0, 0, 0);"&gt;Alaska Salmon Program, University of WA - Research Consultant&lt;/span&gt;&lt;br /&gt;02/ 2007 – present&lt;br /&gt;I currently work for a grant-funded university research department with many data contributors.  My position has three primary components.  First, I created a database to capture long term environmental, biological, and fisheries data.  Second, I provide ongoing technical assistance to several teams of field researchers and data support for scientific papers published by the program.  This has included onsite fieldwork in Alaska.  Finally, I hire, train, and supervise a team of data entry personnel.&lt;br /&gt;&lt;br /&gt;&lt;span style="color: rgb(0, 0, 0);"&gt;Washington State Democratic Central Committee - Data Analyst&lt;/span&gt;        &lt;br /&gt;08/2008 – present&lt;br /&gt;Based on my successful work in 2006, I was contacted to work on a contract basis for the 2008 election cycle.  I provide data analysis for the WSDCC in support of local, state, and national electoral contests.&lt;br /&gt;&lt;br /&gt;&lt;span style="color: rgb(0, 0, 0);"&gt;Washington State Democratic Coordinated Campaign - Database Manager&lt;/span&gt;&lt;br /&gt;07/2006 – 12/ 2006&lt;br /&gt;I managed the Washington State Democratic Party voter-file database and provided technical support for Senatorial and Congressional campaigns.  Key objectives for this position were ensuring data integrity and providing statistical analysis.  I optimized the use of existing data using statistical analysis and I was able to significantly reduce statewide campaign spending by eliminating errant or redundant data and by improving data capture methods.  I independently researched and directed successful micro-targeting analysis using demographic modeling.  In this role, I provided management oversight of software vendors, support staff, and volunteers.&lt;br /&gt;&lt;br /&gt;&lt;span style="color: rgb(0, 0, 0);"&gt;Western Wireless Inc. - Database Reporter&lt;/span&gt;&lt;br /&gt;08/ 2005 – 05/2006&lt;br /&gt;In this contract position, I developed sales and marketing reports querying various Oracle and SQL databases.  Reports were primarily pivot-table report/chart presentation in Excel.  I also reverse engineered and debugged pre-existing, undocumented work by prior programmers.&lt;br /&gt;&lt;br /&gt;&lt;span style="color: rgb(0, 0, 0);"&gt;Planet Organics - Database and Website Developer&lt;/span&gt;                 &lt;br /&gt;04/2001 – 08/2005&lt;br /&gt;For this small but rapidly expanding business, I served as the jack-of-all trades for all computer needs from hardware installation to sophisticated analytics.  I managed the design and development of databases in SQL, Great Plains, and MS Access.  I also maintained data integrity between these databases and established back-up procedures.  I oversaw the website and customer technical assistance.  I coded, tested, debugged, and documented new applications.  I also provided technical training to all staff.  I administered security measures to restrict unauthorized use of data systems and databases, managed email lists, and supervised the quality of the data entry process.  Finally, I tracked website metrics and used analysis of metrics to improve customer acquisition and retention, as well as to promote increased revenue.  I consistently achieved #1 natural rankings on all major search engines for this business.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7444616934468721198-7111843654245137693?l=rainriversea.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://rainriversea.blogspot.com/feeds/7111843654245137693/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7444616934468721198&amp;postID=7111843654245137693&amp;isPopup=true' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7444616934468721198/posts/default/7111843654245137693'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7444616934468721198/posts/default/7111843654245137693'/><link rel='alternate' type='text/html' href='http://rainriversea.blogspot.com/2008/11/abbreviated-resume.html' title='Abbreviated resume'/><author><name>Paul Clayton</name><uri>http://www.blogger.com/profile/12852763566980520419</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://3.bp.blogspot.com/_FdlS-DRoyac/SSoKH9aDk0I/AAAAAAAAAAM/dwbDlKOhZ50/S220/IMG_1305.jpg'/></author><thr:total>0</thr:total></entry></feed>
