-
Notifications
You must be signed in to change notification settings - Fork 394
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
June community gems #2501
Merged
Merged
June community gems #2501
Changes from all commits
Commits
Show all changes
21 commits
Select commit
Hold shift + click to select a range
746c59a
added a couple more questions and answers
flippedcoder 37b0d8a
added another question/answer
flippedcoder 31c46e5
added another question/answer
flippedcoder f634afc
added another question/answer
flippedcoder 66bd767
added last question
flippedcoder a9f5b7c
updated file name
flippedcoder f49900f
Merge branch 'master' into may-community-gems
flippedcoder 73ae0fe
fixed some typos and made minor updates
flippedcoder 8225553
Update content/blog/2021-06-25-june-21-community-gems.md
flippedcoder 3a3672e
Update content/blog/2021-06-25-june-21-community-gems.md
flippedcoder 0b1c6c7
Update content/blog/2021-06-25-june-21-community-gems.md
flippedcoder d34d30a
Update content/blog/2021-06-25-june-21-community-gems.md
flippedcoder 6c86290
addressed feedback
flippedcoder 475787b
Restyled by prettier (#2569)
restyled-io[bot] 96fc64a
Merge branch 'may-community-gems' of https://github.com/iterative/dvc…
flippedcoder e8a3a3f
updated gems based on comments
flippedcoder 23c5ddd
updated commands
flippedcoder 5bc3aed
added links to PRs
flippedcoder 073b305
updated answer to address comments
flippedcoder f330918
minor formatting
flippedcoder 766d9b3
more minor clean up
flippedcoder File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,142 @@ | ||
--- | ||
title: June '21 Community Gems | ||
date: 2021-06-30 | ||
description: | | ||
A roundup of technical Q&A's from the DVC community. | ||
This month: DVC pipeline configs, working with remotes, | ||
file handling and more. | ||
descriptionLong: | | ||
A roundup of technical Q&A's from the DVC community. | ||
This month: DVC pipeline configs, working with remotes, | ||
file handling and more. | ||
picture: 2021-06-30/gems-cover.png | ||
author: milecia_mcgregor | ||
commentsUrl: https://discuss.dvc.org/t/june-21-community-gems/779 | ||
tags: | ||
- Community | ||
- Plots | ||
- Pipelines | ||
- CML | ||
- Git | ||
--- | ||
|
||
### [Q: Is it possible to plot multiple experiments together?](https://discord.com/channels/485586884165107732/563406153334128681/834387923482181653) | ||
|
||
You can use experiment names in the `dvc plots` commands. You need to use the | ||
`diff` command to compare multiple plots. Try | ||
`dvc plots diff exp-6ef18 exp-b17b4 exp-26e88`. | ||
|
||
Thanks to @PythonF from Discord for asking this question that led to this Gem! | ||
💎 | ||
|
||
### [Q: Where is the list of experiment being pushed in Git when I run `dvc exp push`?](https://discord.com/channels/485586884165107732/563406153334128681/837773937390649364) | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Shouldn't |
||
|
||
It uses custom Git refs internally, similar to the way GitHub handles PRs. It’s | ||
a custom DVC Git ref pointing to a Git commit. Here's an example. | ||
|
||
```dvc | ||
$ git show-ref exp-26220 | ||
c42f48168830148b946f6a75d1bdbb25cda46f35 refs/exps/f1/37703af59ba1b80e77505a762335805d05d212/exp-26220 | ||
``` | ||
|
||
If you want to see your local experiments (that have not been pushed), you can | ||
run `dvc exp list --all`. | ||
|
||
You can read more about how we handle our custom Git refs in | ||
[this blog post](https://dvc.org/blog/experiment-refs). | ||
|
||
Thanks to @Chandana for asking this question about experiments! | ||
|
||
### [Q: Is there a way to list all the experiments I have on my DVC remote that have not been committed to Git?](https://discord.com/channels/485586884165107732/563406153334128681/836705209039978538) | ||
|
||
Yes! You can quickly look at all of the experiments in any repo with: | ||
|
||
```dvc | ||
$ dvc exp list --all <git repo URL> | ||
``` | ||
|
||
or | ||
|
||
```dvc | ||
$ dvc exp list --all <git remote> | ||
``` | ||
|
||
Thanks again @Chandana for this gem! | ||
|
||
### [Q: Is CML compatible with Azure DevOps?](https://discord.com/channels/485586884165107732/728693131557732403/841664412221177926) | ||
|
||
Another great question from @Chandana! | ||
|
||
Right now, we support GitHub and GitLab. | ||
|
||
Azure DevOps and GCP (Google Cloud Platform) support are on the roadmap. Stay | ||
tuned for more updates! | ||
|
||
You can stay up to date with our Azure DevOps progress on | ||
[this issue on GitHub](https://github.com/iterative/cml/issues/142). You can | ||
also follow along with GCP updates with | ||
[this issue](https://github.com/iterative/terraform-provider-iterative/issues/64). | ||
|
||
### [Q: I pushed a lot of files using `dvc push` to my DVC remote, but there are a few that couldn't be pushed at the time. If I run `dvc push` again, will it just upload the missing files?](https://discord.com/channels/485586884165107732/563406153334128681/842662337159757854) | ||
|
||
Thanks for the question @petek! | ||
|
||
Yes! You can just re-run `dvc push` and it will only upload the missing files. | ||
|
||
It might be a little slower than you would expect because DVC has to do some | ||
checks to make sure that the other files were uploaded successfully before, but | ||
as far as the actual data transfer goes, only the missing files will be | ||
uploaded. | ||
|
||
### [Q: Let's say I have a DVC pipeline with two stages, can I only pull the second one and keep the first one for other uses? Can I pull some specific output from the pipeline?](https://discord.com/channels/485586884165107732/485596304961962003/841688323663855616) | ||
|
||
You can pull specific outputs from a pipeline with | ||
`dvc pull path/to/specific/output`. This is similar to how you can use `dvc add` | ||
to work with specific files and directories. | ||
|
||
Thanks for such a great question @LucZ! | ||
|
||
### [Q: How does DVC handle incremental changes in the data and how does it work with non-DVC based pipeline features?](https://discord.com/channels/485586884165107732/485596304961962003/846364469524430848) | ||
|
||
These are good questions for common problems in MLOps from @Phoenix! | ||
|
||
To answer the first part, say you are getting new data every week. When you use | ||
DVC, you don't have to worry about getting duplicate data. | ||
|
||
DVC supports file-level deduplication right now, so if your data is in a shape | ||
of directory with files, then all unique files will only be stored once. | ||
Chunk-level deduplication is on our todo list. You can see how it's going in | ||
[this issue we have on GitHub](https://github.com/iterative/dvc/issues/829). | ||
|
||
For the second part of the question, you can use data management with DVC and | ||
have your own pipelines. Just treat it as Git for data then be sure to | ||
`dvc add`, `dvc push`, `dvc pull` and you should be set. Hooks, like | ||
`pre-commit` or `post-pipeline-run`, are a good way to go about it. | ||
|
||
### [Q: Is there a way to tell DVC to use a different profile instead of the default profile when interacting with S3?](https://discord.com/channels/485586884165107732/563406153334128681/846857498094469120) | ||
|
||
When you have a remote that is not on your default AWS profile and when you | ||
access it via the `awscli` using something like | ||
`aws s3 --profile=second_profile ls`, you'll need to update your remote config | ||
in DVC. | ||
|
||
You can run a command like: | ||
|
||
```dvc | ||
$ dvc remote modify myremote profile myprofile | ||
``` | ||
|
||
Check out the docs on `dvc remote modify` for all the remote config options. | ||
|
||
Great question @Avi! | ||
|
||
--- | ||
|
||
https://media.giphy.com/media/l0IycQmt79g9XzOWQ/giphy.gif | ||
|
||
At our July Office Hours Meetup we will be demo-ing pipelines as well as CML. | ||
[RSVP for the Meetup here](https://www.meetup.com/DVC-Community-Virtual-Meetups/events/279024694/) | ||
to stay up to date with specifics as we get closer to the event! | ||
|
||
[Join us in Discord](https://discord.com/invite/dvwXA2N) to get all your DVC and | ||
CML questions answered! |
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Gems
I see that this tag is present in the previous posts, should we add it here?There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@flippedcoder I know we discussed this, but I'm not sure what is the end result
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@shcheklein I haven't seen that
Gems
tag in the previous posts, so I'd leave it out. Was there a particular post you're seeing this on?There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@flippedcoder
Hmm .. This is from the https://dvc.org/blog/may-21-community-gems:
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I've been confused by tags too. Currently used ones:
alphabetical
frequency
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I see. I just made up the
Gems
tag for the last post, it wasn't there before. I wouldn't mind going back and addingGems
to all of the previous posts so that they get grouped together.Maybe we should always have
Gems
,Discord
,DVC
,CML
on the Community Gems posts and the others are dependent on the questions.What do y'all think about that? It doesn't seem like there have been any particular ways tags were used on these posts before so it'd be nice to standardize it a little.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ah, I see. No worries. Let's merge it and may be even fix the previous post?