You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
After developing the updated test.py command for Ersilia #1203 , I believe having a detailed report for model contributors and users would be helpful in debugging and understanding the models in context.
The testing report summary, along with including the metrics measured in the test.py command should also include the following (not an exhaustive list):
Inuputs that were tested, inputs that raised specific errors
Software Dependencies: List of all libraries and tools used, including versions
Espeically helpful if users want to run and download the models locally
Warnings if models don't work on certain operating systems (MacOS, Windows, etc...)
Performance Tests:
metrics such as speed, memory usage
Output checks, If errors are raised or ouputs are invalid, describe them in context:
Missing Values
Outliers
Formatting Errors
The text was updated successfully, but these errors were encountered:
Thanks @kurysauce @DhanshreeA to bring you up to speed, this is not something we need to tackle immediately but it is a nice feature request from @kurysauce . Let's leave it in out to-do list for now.
Update check_consistent_output and comparison between bash & Ersilia run in run_bash methods to allow comparisons for data structure containing different output types. Current method only supports comparisons between outputs that are all numerical values and all strings.
Describe your feature request.
After developing the updated
test.py
command for Ersilia #1203 , I believe having a detailed report for model contributors and users would be helpful in debugging and understanding the models in context.The testing report summary, along with including the metrics measured in the
test.py
command should also include the following (not an exhaustive list):The text was updated successfully, but these errors were encountered: