You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
These are three thoughtful and interesting proposals.
The first struck me as a bit humdrum, but I think there's actually quite a bit of clever work that you could do here. One interesting angle is would be to run a model at several different levels of nesting, where you fit a bunch of models, each time controlling for more and more things. It'd be interesting to track how the residuals for each school shift as you move from one model to the next (and what does a residual mean in this context?). There also another rich data set called the College Scorecard that you could draw upon. I think this project is the best bet.
The other two projects get into the wide world of text-as-data, which can take quite a bit of wrangling before you can get usable features out of. Of the two, the "toxic" one seems more promising if the data exists and if you're able to annotate it without too much trouble. I think this one could work.
The twitter project would be difficult I think, because you'd want to be able to make a strong causal claim about twitter use effecting company performance, but you'd just be working with observational data and there's likely a complicated web of causation that will muddy any clear conclusion.
The text was updated successfully, but these errors were encountered:
These are three thoughtful and interesting proposals.
The first struck me as a bit humdrum, but I think there's actually quite a bit of clever work that you could do here. One interesting angle is would be to run a model at several different levels of nesting, where you fit a bunch of models, each time controlling for more and more things. It'd be interesting to track how the residuals for each school shift as you move from one model to the next (and what does a residual mean in this context?). There also another rich data set called the College Scorecard that you could draw upon. I think this project is the best bet.
The other two projects get into the wide world of text-as-data, which can take quite a bit of wrangling before you can get usable features out of. Of the two, the "toxic" one seems more promising if the data exists and if you're able to annotate it without too much trouble. I think this one could work.
The twitter project would be difficult I think, because you'd want to be able to make a strong causal claim about twitter use effecting company performance, but you'd just be working with observational data and there's likely a complicated web of causation that will muddy any clear conclusion.
The text was updated successfully, but these errors were encountered: