Forums before death by AOL, social media and spammers... "We can't have nice things"
|    alt.ufo.reports    |    The latest from planet crackpot    |    8,965 messages    |
[   << oldest   |   < older   |   list   |   newer >   |   newest >>   ]
|    Message 8,499 of 8,965    |
|    Kym Horsell to All    |
|    Why do some states report more UFOs? (p1    |
|    02 May 23 18:57:28    |
      From: kymhorsell@gmail.com              EXECUTIVE SUMMARY:       - We run a new s/w over UFO sightings of different types and see which        state demographics predict local UFO activity.       - Some factors occur in several UFO models. It seems state temperature        and distance from the sea figure in several key "types".       - Some key factors may relate to the behaviour of UFO witnesses. But        almost all seem to relate "mostly" to the reported behaviour and/or        imputed interests of UFOs. We can reason in some cases a factor can        not reasonably be associated with state residents in the way or to        the extent the stats procedure claims it does.       - Finally, we look at a couple of key UFO types and note some        interesting and some unpleasant associates they may have. In many        cases the attributes found by the s/w using only relatively        un-imaginative data science techniques and a wide range of data are        the same suggested by many UFO witnesses and/or researchers over a        number of decades. Seemingly, no matter how crazy some of the claims        may sound, some of them seem to have some backup in the hard data.              The AI's have been chattering away and come up with some new takes on       the patterns of UFO sightings across N America. They've had some new       tools to play with in the past few months and this report is based on       state by state "lasso regression" that validates the model found.              A lasso regression involves not only trying to find the "best fit" of       a line across a dataset to follow a given target variable (in our       case, here, some kind of UFO count state by state) but it also tries       to make the resulting model "as simple as possible" by also including       a special "regularisation term" to the fitting value to be       optimised. Normally this means as many coefficient as possible in the       model get set to 0 and only the "key variables" are left to explain       the target data as well as they can.              If that wasn't enough of a thing to grasp in one sitting, the       "validation" part gets added on. It turns out just statistical tests       are not always the best way to establish a model is "good". It may       appear to fit the data very well (typically meaning the so-called       "explanation power" statistic -- the R2 -- is reasonably close to 1)       yet have no predictive power at all. A new set of data -- despite the       claims of statistical tests -- will fail to be well-estimated by the       model usually because "something changed" between the data you had       when the model was created and the time you got the new data to use to       make a new estimate. To better ensure a stat model will work with new       data at some time later we have to at least run a test for that one       time. And the way most stats-heads and s/w packages do it is to hold       some of the data aside when building the model to test whether it       actually explains the "hidden data" as well as the data used to build       the model.              So in the procedure used here the model is created using just a random       1/2 of the data to hand, then it is tested against the other 1/2 of       the data and if both halves are reasonably-well explained by the model       then it is accepted as have not only statistical robustness, but       predictive power.              So these are the models we will now look at. Using all the state       demographics and other data to hand we will try to build robust and       predictive models for UFO activity of various types over each state.       In the UFO dataset I normally use -- the set from the NUFORC -- the       types of sighting are normally broken down by "shape" of the object or       objects seen. The NUFORC have a long-established set of shapes and I       normally follow those fairly closely. Some new shapes tend to get       added over time, following "fashion sense" among witnesses. E.g. a       "tictac" was added at some point. But I also have chosen to sometimes       add yet other shapes my own fashion sense dictates. E.g. I look in       the comment section of reports for certain keywords like "faint",       "bright", "cube", or "pyramid", among others.              And so we will look here at how a series of validated lasso       regressions explained sightings of a collection of UFO types across N       America. Viz:        pink        purple        Egg        Disk        Oval        Circle        Sphere        bright        faint        gold        red        yellow        black        Light        notLight        Triangle              In general those keywords starting with a capital are NUFORC's chosen       shape names while the lower-case words are ones I've chosen for one       reason or another to locate in the comment section of reports. They       are typically colours but also can indicate other things like "bright"       and "faint". Most of these can be applied to Lights type UFO's. But       they also may apply to lights seen *on* some other kind of UFO.       Typically a witness reports whether there were lights on the bottom of       some otherwise dark object, or sometimes they see what they think are       "port-holes" or "domes" on object through which some coloured light is       visible as well. And, finally, the notLight category is all of those       UFO's not in the Light category. It make sense to split up what people       typically see as "little lights going across the sky at night" and       everything else they might see typically in the daytime and oftentimes       reasonably up close and personal.              There are of course many other shapes and colours. But these are the       boundaries we're using today. :)              I currently have 915 state-by-state statistics. These cover a very       wide range of things including social factors, economy, environment,       weather, medical, law, crime and many other things. All of these are       thrown into the mix and all the above UFO types are cross-checked       against all 915 factors and the best ones "lassoed" and validated.              We then can check to see whether some factors are common across all       the selected UFO types. And there are a few. Ignoring the factors       that only apply to one type we find the common factors are:              Variable Number of UFO types with this var as a relevant factor        in NUFORC sightings since ~1950       coffeerank 5       maxtemp 5       murder2019pc 5       coastdist 4       diversity 3       infgasgatherint 3       medage 3       birthrate 2       carspc 2       casespc20210408 2       cattledeathspredpct2015 2       coalconsint 2       coviddailycasespc20211121 2       dope 2       evpct 2       lung 2       maskapr2021 2       minpre 2       missingdoenetworkpc 2       pctgenrenew 2       pctnonwhite 2       planeaccidents-personal 2       pre6 2       suiciderate 2       totdeathspc14feb2021 2       under18 2       usoutagesvandalismpc 2       wind 2              Hopefully most of these things are self-explanatory. Some factors are       related to conditions in the US at a certain time. Some are factors              [continued in next message]              --- SoupGate-Win32 v1.05        * Origin: you cannot sedate... all the things you hate (1:229/2)    |
[   << oldest   |   < older   |   list   |   newer >   |   newest >>   ]
(c) 1994, bbs@darkrealms.ca