"My own statistics prof said..."
I am sure we sure we are more than capable of looking beyond the scope of what your statistics professor had time to teach you at university. I have some knowledge and education of statistics myself, not that it makes me particularly more entitled to comment about it.
"Thats not the skill that's taught in a statistics degree."
I commend you for apparently having a statistics degree of some form. To suggest that analysing and comprehending large amounts of data isnt taught in a statistics degree makes me question your statistics degree. I'm not saying your degree is any better worse, perhaps just unique. Of course, comprehending large amounts statistical data would lead to the use of algorithms to accurately explain the data. We rely on algorithms and mathematics for statistical analysis. Understanding the 'complicated' maths or Bayes theorem wouldnt seem like that great a stretch given the OP's education which is my initial point.
I have studied bioinformatics and as such I have a particular idea of the domain of medical statistics and the domain of bioinformatics.
Big data often means that testing for 5% significance is a bad idea. As a result people working on big biological data weren't very welcome by the frequentists in medical statistics and bioinformatics formed it's own community.
That community split produces effects such as bioinformatics having it's own server for R packages and not using the server in which the statistics folks put their R packages.
In another post in thi...
Imagine you had the following at your disposal:
Imagine that your goal were to slow or prevent biological aging...
Thanks for your input.
Update
I thank everyone for their input and apologize for how long it has taken me to post an update.
I met with Aubrey de Grey and he recommended using the anonymized patient data to look for novel uses for already-prescribed drugs. He also suggested I do a comparison of existing longitudinal studies (e.g. Framingham) and the equivalent data elements from our data warehouse. I asked him that if he runs into any researchers with promising theories or methods but for a massive human dataset to test them on, to send them my way.
My original question was a bit to broad in retrospect: I should have focused more on how to best leverage the capabilities my project already has in place rather than a more general "what should I do with myself" kind of appeal. On the other hand, at the time I might have been less confident about the project's success than I am now. Though the conversation immediately went off into prospective experiments rather than analyzing existing data, there were some great ideas there that may yet become practical to implement.
At any rate, a lot of this has been overcome by events. In the last six months I realized that before we even get to the bifurcation point between longevity and other research areas, there are a crapload of technical, logistical, and organizational problems to solve. I no longer have any doubt that these real problems are worth solving, my team is well positioned to solve many of them, and the solutions will significantly accelerate research in many areas including longevity. We have institutional support, we have a credible revenue stream, and no shortage of promising directions to pursue. The limiting factor now is people-hours. So, we are recruiting.
Thanks again to everyone for their feedback.