Meeting Minutes for Week 3 #51

SoloSynth1 · 2024-05-10T23:06:51Z

tonyshumlh · 2024-05-15T21:28:03Z

tonyshumlh · 2024-05-16T20:04:04Z

Possible Issue for Checklist:

Convenience vs Version Control

Possible Issue for the Application:

Error handling on the variation of LLM response (e.g. some JSON might not be easily parsed)
Error handling on the truncation of LLM response (e.g. LLM might miss to output the evaluation test of part of the checklist items)

tonyshumlh · 2024-05-17T19:59:40Z

Partner Meeting - 2024/05/17 Week 3

Comment
If a leader/teacher evaluates, they might not know the path of test function/file. Better if the report provides the path and line number for easier review -> extract line number with scripts
Need more information for partial/non-satisfied checklist items, e.g. add function name and line number per each test file
(Good to Have) output HTML report for more detailed report than report in CLI -> refer to DSCI522 pytest coverage session
Need error handling when there is no test file / function, e.g. identify edge case, raise error, give 0 scores and give the checklist in human readable format
Use Regression to evaluate if a parameter (X) is associated with consistency - completeness score (Y)
For consistency, We can do 1) prompt engineering on checklist, 2) show the explanation and/or uncertainty
We can come back to Consistency after functionality development and prompt engineering
Depth vs Breadth on the system: 1) If checklist-oriented, focus on 1 or a few repo and revise and enrich the checklist; 2) focus on certain test area and apply to multiple repos; focus more on 2) in system dev
Checklist format: confirmed to use CSV, then enable to convert into QMD/HTML for view
For Product 1.0, it is important for user to read the checklist instead of editing it
Might need a converter to convert code checklist to human readable checklist (HTML,PDF)
CSV can be embedded into QMD/HTML file, user can use Pandas to make table
Open Github Issue for Tiffany to add/review the (3-5) checklist item + Slack message, e.g. review N-th item in the website
For Checklist Citation, can put "General Knowledge" for common sense items and Tiffany will review it

tonyshumlh · 2024-05-17T20:30:07Z

JohnShiuMK · 2024-05-20T07:33:48Z

Partner Meeting Minutes - May 17, 2024

Attendees: John, Orix, Tiffany (Partner), Tony, Yingzi

Key Points Discussed:
System for Researcher Persona

Evaluation Report Output:
- Include the path and line number of related functions for each checklist item
- Provide more elaboration behind partial/non-satisfied checklist items (from a teacher’s perspective)
- Render the report (including score, summary, and breakdown) into HTML format
- Refer to examples from DSCI522 Pytest coverage in HTML format
Edge Case Handling:
- Example 1: If a repository has no test files or functions, the system may output a message like "there are no test cases in this repo."
- Example 2: Detect and handle cases where the project is not related to Machine Learning
Focused Development:
- Focus on 3-5 checklist items
- (First) Build the system in depth based on these items using one repository (lightfm)
- (Then) Apply the system to 4-5 other repositories.

System Evaluation for Ourselves (System Developer Persona)

"Completeness Score" Consistency Metrics:
- Examine the Consistency improvement using a regression model (Response: Y = Consistency; Explanatory variable: X = the System Change)
- Continue working on prompt engineering to minimize uncertainty; and/or,
- Consider outputting the uncertainty of the Score/Evaluation along with the report and explanation

Checklist for Leader Persona

Checklist Format and Visualization:
- Confirmed to use CSV format of the checklist as the single source of truth
- As a version 1.0 of the System, we will focus on users reading the checklist instead of editing it
- Convert the checklist CSV(s) into a human-readable HTML format using Pandoc or Quarto to facilitate visualization
Checklist Collaboration:
- Use Github issues + Slack for communication with Tiffany
- Focus each communication on 3-5 checklist items or one area of items instead of the entire checklist
- For checklist citation, use "General Knowledge" for common sense items. Tiffany will review these citations

JohnShiuMK · 2024-05-20T08:09:19Z

Proceed to #72

JohnShiuMK changed the title ~~Sprint Planning - 2024/05/13 Week 3~~ Meeting Minutes for Week 3 May 13, 2024

JohnShiuMK added the admin meeting related label May 13, 2024

JohnShiuMK assigned SoloSynth1, JohnShiuMK, tonyshumlh and jinyz8888 May 20, 2024

JohnShiuMK closed this as completed May 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Meeting Minutes for Week 3 #51

Meeting Minutes for Week 3 #51

SoloSynth1 commented May 10, 2024 •

edited

tonyshumlh commented May 15, 2024 •

edited by jinyz8888

tonyshumlh commented May 16, 2024

tonyshumlh commented May 17, 2024 •

edited

tonyshumlh commented May 17, 2024 •

edited by SoloSynth1

JohnShiuMK commented May 20, 2024 •

edited

JohnShiuMK commented May 20, 2024

Meeting Minutes for Week 3 #51

Meeting Minutes for Week 3 #51

Comments

SoloSynth1 commented May 10, 2024 • edited

tonyshumlh commented May 15, 2024 • edited by jinyz8888

tonyshumlh commented May 16, 2024

tonyshumlh commented May 17, 2024 • edited

tonyshumlh commented May 17, 2024 • edited by SoloSynth1

JohnShiuMK commented May 20, 2024 • edited

JohnShiuMK commented May 20, 2024

SoloSynth1 commented May 10, 2024 •

edited

tonyshumlh commented May 15, 2024 •

edited by jinyz8888

tonyshumlh commented May 17, 2024 •

edited

tonyshumlh commented May 17, 2024 •

edited by SoloSynth1

JohnShiuMK commented May 20, 2024 •

edited