-
Notifications
You must be signed in to change notification settings - Fork 1
snakemake pipeline #34
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
test to ensure sensitive things do not get uploaded
because cron gives us a clean environment. Create FTPS sentinel file.
Fix docformatter errors in place, and stop ruff and docformatter from fighting with each other.
4aef42c to
e8d2f24
Compare
c387fef to
4fe3e95
Compare
48c945a to
5424a3e
Compare
skeating
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I couldn't get it running locally and I certainly couldn't find a 'tools' directory as given in it develop.md file
I think this could be filed as an issue to revisit if we have time; so not a blocker
Interesting, @thompson318 is that your script and did you intend to commit it to this repo? |
Implements #29 and fixes #26.
Snakemake is run according to a cron specification (default is once a day in the early hours of the morning).
It causes the CSVs output by the waveform-controller to be converted to both kinds of parquet, and then the de-id parquet to be uploaded to the DSH, leaving behind marker files with upload stats so that snakemake knows they are done, and so do the humans!
Toy hasher needed a fix because Python's
hashmethod is not stable from run to run. (Switch to using real hasher is in #35)Also add a pipeline debugging guide.