Comment author: gjm 22 August 2016 10:13:27PM -1 points [-]

Wait, your category (ii) is surely exactly what we care about here. We want to know: For someone whose background would lead you not to expect high-paying data science jobs, is Signal effective in getting them a better chance of a high-paying data science job?

Comment author: JonahSinick 23 August 2016 12:30:06AM *  1 point [-]

Wait, your category (ii) is surely exactly what we care about here.

Yes, I see how my last message was ambiguous.

What I had in mind in bringing up category (ii) is that we've had some students who had a priori worse near term employment prospects relative to the usual range of bootcamp attendees, who are better positions than they had been and who got what they were looking to get from the program, while not yet having $100k+ paying jobs. And most students who would have gotten $100k+ paying jobs even if they hadn't attended appear to have benefited from attending the program.

The nature of the value that we have to add is very much specific to the student.

Comment author: ThisSpaceAvailable 21 August 2016 01:31:02AM *  4 points [-]

I suppose this might be better place to ask than trying to resurrect a previous thread:

What kind of statistics can Signal offer on prior cohorts? E.g. percentage with jobs, percentage with jobs in data science field, percentage with incomes over $100k, median income of graduates, mean income of graduates, mean income of employed graduates, etc.? And how do the different cohorts compare? (Those are just examples; I don't necessarily expect to get those exact answers, but it would be good to have some data and have it be presented in a manner that is at least partially resistant to cherry picking/massaging, etc.) Basically, what sort of evidence E does Signal have to offer, such that I should update towards it being effective, given both E, and "E has been selected by Signal, and Signal has an interest in choosing E to be as flattering rather than as informative as possible" are true?

Also, the last I heard, there was a deposit requirement. What's the refund policy on that?

Comment author: JonahSinick 22 August 2016 09:33:02PM 1 point [-]

Hello! I'm a cofounder of Signal Data Science.

Because our students have come into the program from very heterogeneous backgrounds (ranging from high school dropout to math PhD with years of experience as a software engineer), summary statistics along the lines that you're looking for are less informative than might seem to be the case prima facie. In particular, we don't yet have meaningfully large sample of students who don't fall into one of the categories of (i) people who would have gotten high paying jobs anyway and (ii) people who one wouldn't expect to have gotten high paying jobs by now, based on their backgrounds.

If you're interested in the possibility of attending the program, we encourage you to fill out our short application form. If it seems like it might be a good fit for you, we'd be happy to provide detailed answers to any questions that you might have about job placement.

Comment author: ThisSpaceAvailable 16 June 2016 02:21:42AM 0 points [-]

"We're planning another one in Berkeley from May 2nd – July 24th."

Is that June 24th?

Comment author: JonahSinick 17 June 2016 10:48:35PM 0 points [-]

Yes, that was supposed to be June 24th! We have a third one from July 5th – August 24th. There are still spaces in the program if you're interested in attending.

Comment author: Fluttershy 12 April 2016 02:22:53AM 1 point [-]

I've already had versions of this conversation with Robert and Jonah in person, but I'll reiterate a few things I shared with them here, since you asked politely. Also, this conversation is becoming aversive to me, so it will become increasingly difficult for me to respond to your comments as we get farther and farther down this comment chain.

specific examples of times when Jonah's explanations were too abstract and not sufficiently practical?

There were actually multiple times during the first couple weeks when I (or my partner and I) would spend 4+ hours trying to fix one particular line of code, and Jonah would give big-picture answers about e.g. how linear regression worked in theory, when what I'd asked for were specific suggestions on how to fix that line of code. This led me to giving up on asking Jonah for help after long enough.

what are some specific topics that you think were neglected in favor of more abstract but less applicable material?

Intermediate and advanced SQL, practice of certain social skills (e.g. handshakes, being interested in your interviewer, and other interview-relevant social skills), and possibly nonlinear models.

Comment author: JonahSinick 12 April 2016 09:07:44AM 3 points [-]

Thanks for the written feedback (which adds to what I had gleaned in person).

There were actually multiple times during the first couple weeks when I (or my partner and I) would spend 4+ hours trying to fix one particular line of code, and Jonah would give big-picture answers about e.g. how linear regression worked in theory, when what I'd asked for were specific suggestions on how to fix that line of code. This led me to giving up on asking Jonah for help after long enough.

I think that what happened here is me having misunderstood what you were asking for, rather than any disinclination on my part to help you with individual lines of code. I will take this feedback into account.

Intermediate and advanced SQL, practice of certain social skills (e.g. handshakes, being interested in your interviewer, and other interview-relevant social skills), and possibly nonlinear models.

This is helpful detail regarding what you were looking for. Which topics would you have preferred to have been been dropped in favor of these?

Comment author: Toggle 11 April 2016 05:53:32PM 1 point [-]

How many students have found work in data science (so far), what problems are they solving now, and what are the associated companies/cities/salaries?

Comment author: JonahSinick 12 April 2016 01:26:52AM *  1 point [-]

Hi Toggle,

Thanks for your question!

Most of our students have just started looking for jobs over the past ~2 weeks, and the job search process in the tech sector typically takes ~2 months, from sending out resumes to accepting offers (see, e.g. "Managing your time" in Alexei's post Maximizing Your Donations via a Job).

The feedback loop here is correspondingly longer than we'd like. We expect to have an answer to your question by the time we advertise our third cohort.

An update on Signal Data Science (an intensive data science training program)

5 JonahSinick 09 April 2016 05:02AM

In December 2015, Robert Cordwell and I cofounded Signal Data Science (website), which we announced on Less Wrong.

Our first cohort has just concluded, and overall went very well. We're planning another one in Berkeley from May 2nd – June 24th. The program is a good fit for people who are both excited to learn how to extract insights from data sets and looking to prepare for industry data science jobs. If you're interested attending the next cohort, we would love to hear from you. You can apply here, or contact us at signaldatascience@gmail.com.   

We offer inquiry-based learning and an unusually intellectually curious peer group. Unlike typical college classes, Signal Data Science focuses on learning by doing. You’ll learn from a combination of lectures, short knowledge-reinforcement problems, and longer, more open-ended assignments focusing on analyzing real datasets. (That’s your chance to discover something new!) Don’t worry if that sounds daunting: our instructors will be there to support you every step of the way.

You’ll learn both the theory and the application of a wide array of data science techniques. We offer a pair programming-focused curriculum, allowing students to learn from each other’s strengths. We cover everything from basic linear regression to advanced, industry-relevant methods like support vector machines and dimensionality reduction. You’ll do an advanced, self-directed project at the end of the course. Curious? Check out our showcase of past students’ final projects. Whatever your interests are—from doing something with real-world, industry-relevant applicability to applying cutting-edge neural nets—we’ll work with you to find a project to match your interests and help you showcase it to prospective employers.

Less Wrong readers might be especially interested by Olivia Schaefer's project, which describes results of doing some natural language processing on the Less Wrong comment corpus, explaining how the words pictured in different colors below are at opposite ends of an axis.

 

Comment author: Fluttershy 19 December 2015 12:56:38PM 1 point [-]

Neat! Here are the first questions I have:

  • Do you require applicants to have a graduate degree?
  • Zipfian Academy, App Academy, and other bootcamps are 12 weeks long, and (the first instance of) this one is only 6 weeks long. Why is this, and what are you cutting out relative to other data science bootcamps to make it this short? (This is my most pressing question).
  • As a tie-in to my last question, is there a hiring event which employers will be invited to around the end of the program?
  • Do you know which language(s) you'll be using?

Good luck; do keep us posted.

Comment author: JonahSinick 20 December 2015 01:34:58AM *  3 points [-]

Thanks for your interest! Some responses below.

Do you require applicants to have a graduate degree?

No degree is required. We're selecting on ability rather than on credentials.

Zipfian Academy, App Academy, and other bootcamps are 12 weeks long, and (the first instance of) this one is only 6 weeks long. Why is this, and what are you cutting out relative to other data science bootcamps to make it this short? (This is my most pressing question).

  1. Based on the preliminary interest that people have expressed anticipate that the students in our first cohort will be significantly stronger than is typical of data science bootcamps, and will correspondingly be able to cover the material at an accelerated pace. We expect at least some of our cohorts to run a full 12 weeks.

  2. Regarding the comparison with coding bootcamps, there are reasons to believe that the amount that somebody needs to know to be in the top x% of industry data scientists is less than the amount that's needed to be in the top x% of programmers. (I can elaborate.)

  3. We're cutting out some of the more advanced machine learning algorithms, which industry data scientists use infrequently enough so that they can be a distraction from getting started.

As a tie-in to my last question, is there a hiring event which employers will be invited to around the end of the program?

Very few bootcamp students who I know got their jobs through this route, so we may or may not do this depend on how efficient it is relative to other routes. Like other bootcamps that offer the "pay later" model, we have a large stake in ensuring that our students find jobs.

Do you know which language(s) you'll be using?

We'll be working primarily in R, and teaching SQL as well.

Comment author: Daniel_Burfoot 19 December 2015 04:09:22PM 1 point [-]

I hope you throw SQL into your core skills bucket list.

Comment author: JonahSinick 20 December 2015 12:58:50AM 0 points [-]

Yes, we'll definitely be covering this.

Comment author: IlyaShpitser 19 December 2015 06:00:57PM 3 points [-]

Hey Jonah, have you thought about doing some causal inference stuff in the full length course?

Comment author: JonahSinick 20 December 2015 12:58:34AM 2 points [-]

Thanks for the suggestion. That would be wonderful. We'll definitely think about this – it's a matter of whether we can create a sufficiently simple presentation of the material so that the marginal returns per unit time are high for the student population that we'll be working with.

Announcing the Signal Data Science Intensive Training Program

21 JonahSinick 19 December 2015 12:30AM

Note: We now have a website with up to date information here: http://signaldatascience.com/.


(This post is coauthored with Robert Cordwell.)

We’re writing to announce the inaugural run of Signal Data Science’s intensive training program.

The program will train students in the core skills needed to work as a professional data scientist:

  • Scraping and cleaning data
  • Exploring and analyzing data using statistics
  • Presenting findings
  • Interviewing

By the end of the course, you’ll will be able to start with raw data and produce analyses like the one in Bayesian Adjustment of Yelp Ratings. More to the point, you’ll understand why Jonah structured the analysis the way he did and be able to do the same yourself.

You’ll also be able to produce cool visualizations like this automatic grouping of Slate Star Codex posts by topic, as shown below.

Why data science?

Making inferences from data is fundamental to understanding the world, and there’s a growing unmet need in industry for people with the relevant skills. With good instruction and peer group, smart, motivated people can quickly develop enough proficiency to get jobs in the tech sector (starting compensation ~$115k in the San Francisco Bay Area).

Why us?

The Program

We offer inquiry-based learning (no boring lecturers or unmotivating problem sets!) and an unusually intellectually curious peer group. Far from what’s typical of college classes, our model has more in common with the Math Olympiad Summer Program, where daily lectures are interspersed with on-the-spot problems and followed by long-form problems designed to build on the lesson.

Robert Cordwell is an IMO gold medalist and educational startup veteran who’s working a Facebook data science job despite his limited, self-taught experience. He’s going to be teaching math problem solving, overall presentation skills, and how to break interviews.

Jonah Sinick is a data scientist with 13 years of experience making advanced math accessible to beginners, a PhD in math from University of Illinois, and an extensive body of published work. He’ll be teaching a comprehensive technical curriculum.

Who is this for?

If you:

  • Are interested in data science
  • Passionate about learning new things
  • Would benefit from a social environment with others working toward the same goal
  • Have the programming skills to solve simple algorithms problems
  • Plan on applying for data science jobs after the program

our program will be a good fit for you.

Where / When

The first cohort will run in Berkeley for 6 weeks, from Feburary 1st – March 18th. This will be a compressed version of the standard course that we’ll be offering in the future, and is targeted at students who have a high degree of comfort with math.

In the future we’ll be offering longer courses that cover the mathematical / statistical material at a gentler pace.

Cost

For students in our first 6 week cohort, we offer two options:

  • Payment of $8,000 at the start of the program.
  • A “pay later” model where students pay 8% of their first year’s salary (pretax, spaced over 6 months), contingent on getting a data science job.

This is roughly 50% of the standard price for coding /data science bootcamps.

Next steps

If you’re interested in exploring participating in our first cohort, or keeping posted, please be in touch with us at signaldatascience@gmail.com.

View more: Next