-
Notifications
You must be signed in to change notification settings - Fork 54
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Improve DIBELS data, and potentially the import process #1940
Comments
Looking at
@alexsoble can you check out this issue and update this to reflect the latest status based on the awesome work you did in #1969 and #1973? |
Those lines are for incoming DIBELS rows that do not have a benchmark score that matches "CORE", "STRATEGIC", or "INTENSIVE". I sent an email to Uri and you asking if we should do anything differently for rows where Benchmark == "Benchmark". SomervilleDIBELS data is now stored in its own table, which strong validations both at the database level and the Rails level. This means no more invalid data. There are now no more "#DIV/0!" or other invalid values anywhere in the product. The tradeoff is that subtest results don't show up in the Student Profile like they used to. But subtest results are now stored in our database separately from benchmark scores, and we could easily visualize them. We could even do a "hover for subtest results" or "click for subtests" UI element for DIBELS when subtest results exist. Note that all this applies just to Somerville. New BedfordAll DIBELS data has been deleted from the New Bedford instance, since we still need to get handle on how best to import, store, and visualize New Bedford's DIBELS data. ~ @kevinrobinson shall we close this issue and make a new one that says "Import and display DIBELS data for New Bedford"? |
Update from Uri: If it says Benchmark that’s another word for Core (wonder if that’s what they use in New Bedford) |
@alexsoble awesome, thanks! 👍 I think there's more work here for Somerville too that is described in the issue, and my understanding is that there's still some bits we'd ned to do to finish those off. I'll write out what I see so I can remember later, but I think let's consider this handed-off and thanks for updating this! Somerville
Subtest scores
New Bedford
|
This is quite stale, a newer thread of work is related to #2472. |
To finish this off, we should deprecate code related to |
Currently, the
DibelsRow
class ignores subjects, and collapses everything together. New Bedford and Somerville export data in different shapes, so this impacts them differently.We may want to consider solving this differently, and simplifying the human flow of data from reading teacher > Insights, rather than routing this through IT, Aspen, export, import as well.
Somerville
For somerville,
DibelsRows
mostly makes sense, since the "subject" is redundant with the "name" and just indicates what time of the year the assessment was (eg, fall / winter / spring). We throw this away in the import, but it's encoded in theassessment_name
. for somerville, it also looks like thedate taken
might not be inaccurate - all records are set to the same handful of dates across years (eg, 2017-01-01). This could be happening on UH or JB's end but seems like it probably doesn't reflect the actual assessment date. See emails "Winter DIBELS for Upload to Aspen" and "Spring DIBELS and F&P Data for Aspen Import" for more, although these don't represent all exchanges, and the format for data exchange has changed over time. All these are generated from a "master" XLS that UH has.assessment_growth
is a collapsing of several rows in the format "NC: 28,NW: 8,O: 29", and we need to investigate if there are bugs there - spot checking Somerville student #5475, it doesn't match the "Spring DIBELS and F&P Data for Aspen Import" spreadsheet. The CSV says this is "NWFCS" - Nonsense Word Fluency-Correct Sounds, but Insights says this is "PSF" - Phoneme Segmentation Fluency. for this student, it's the difference between being at "K-End" benchmark or the "1-Mid" benchmark. Based on that, we should spot check these some more, and see how prevalent they are.New Bedford
For new bedford, they use
assessment_subject
to store things like "PSF-Phoneme Segmentation" and just put "DIBELS" in the "assessment_name" field. And since we consider the "subject" part of the Assessment record, but throw it away during import inDibelsRow
, we don't have this information.We should fix this, by either uploading directly to insights in a particular format, or rewriting
DibelsRow
and having it work differently by district, while also splittingDibelsAssessment
out into its own Insights table.Also, New Bedford DIBELs data isn't working correctly yet for another reason. There's a validation on StudentAssessment and it says "unique data point for student and assessment_id"
The
Dibels#find_assessment_id
code relies on this, usingfind
...but since there's only one DIBELS
Assessment
in Insights......and NB has different dibels data points, all on the same day...
...this collapses all kinds of data points on top of each other...
...and when combined with #find_assessment_id...
This means that the importer reads a row, finds the assessment, blows away the scores. And then does this for each DIBELS record for a student. If the ordering is consisent and defined, this would be idempotent But it's not necessarily, so depending on the sort order of the export, we could get trashing on these scores on every import job. Email KR for a test case.
The text was updated successfully, but these errors were encountered: