I’m working on a project that requires a dataset of small, self-contained Python files that are known to be bug-free. Ideally, these files would represent complete, functional units of code, not just snippets.
Specifically, I’m looking for:
Self-contained Python files: Each file should be runnable on its own, without external dependencies (beyond standard libraries, if necessary). Bug-free: The files should be reasonably well-tested and known to function correctly. Small to medium size: I’m not looking for massive projects, but rather individual files that demonstrate good coding practices. Optional but desired: Unit tests attached to the files would be a huge plus!
I want to use this dataset to build a static analysis tool. I have been looking for GitHub repositories that match this description. I have tried the leetcode dataset but I need more than that.
Thank you 🙂
submitted by /u/Serious-Aardvark9850
[link] [comments]