DevFW-CICD/garm - EDP: Build your thing in minutes

Author	SHA1	Message	Date
Gabriel	f72e97209f	Merge pull request #200 from gabriel-samfira/add-systeminfo-callback Add system-info instance callback	2024-01-04 17:38:02 +02:00
Gabriel Adrian Samfira	2a5e2409b2	Add system-info instance callback Allow runners to update their own system information. Runners can now send back os_name, os_version and agent_id back as part of a POST to CALLBACK_URL/system-info/. The goal is to get better info in regard to the actual OS that's running and to move the agent_id from the status updates to the system-info callback. The status updates should be used only to send back info about the status of the installation process. Signed-off-by: Gabriel Adrian Samfira <gsamfira@cloudbasesolutions.com>	2024-01-04 15:23:43 +00:00
Gabriel	8fe4f17e1c	Merge pull request #198 from gabriel-samfira/create-dirs Create needed folders before use	2023-12-18 18:50:22 +02:00
Gabriel Adrian Samfira	912371cf57	Create needed folders before use Signed-off-by: Gabriel Adrian Samfira <gsamfira@cloudbasesolutions.com>	2023-12-18 16:50:02 +00:00
Gabriel Adrian Samfira	6c7c5a913f	Define variable before use Signed-off-by: Gabriel Adrian Samfira <gsamfira@cloudbasesolutions.com>	2023-12-18 16:43:30 +00:00
Gabriel	6dbdd5e9b0	Merge pull request #197 from gabriel-samfira/copy-config Copy provider config in garm folder	2023-12-18 18:39:49 +02:00
Gabriel Adrian Samfira	5d596aa94c	Copy provider config in garm folder Signed-off-by: Gabriel Adrian Samfira <gsamfira@cloudbasesolutions.com>	2023-12-18 16:39:30 +00:00
Gabriel	a8f468b4a9	Merge pull request #196 from gabriel-samfira/fix-e2e-tests Export required variables	2023-12-18 18:32:40 +02:00
Gabriel Adrian Samfira	e6eed93546	Export required variables Signed-off-by: Gabriel Adrian Samfira <gsamfira@cloudbasesolutions.com>	2023-12-18 16:31:52 +00:00
Gabriel	ca62deab8d	Merge pull request #195 from gabriel-samfira/update-deps Update deps	2023-12-18 18:25:58 +02:00
Gabriel Adrian Samfira	0dd4f38691	Update go-github Signed-off-by: Gabriel Adrian Samfira <gsamfira@cloudbasesolutions.com>	2023-12-18 16:20:44 +00:00
Gabriel Adrian Samfira	66bf762cd6	Update to latest jwt Signed-off-by: Gabriel Adrian Samfira <gsamfira@cloudbasesolutions.com>	2023-12-18 16:20:44 +00:00
Gabriel Adrian Samfira	3ec6aeace2	Update dependencies Signed-off-by: Gabriel Adrian Samfira <gsamfira@cloudbasesolutions.com>	2023-12-18 16:20:44 +00:00
Gabriel	a13c5db1a7	Merge pull request #194 from gabriel-samfira/remove-lxd-provider Remove the LXD internal provider	2023-12-18 18:15:54 +02:00
Gabriel Adrian Samfira	ff5b9d22a7	Fix k8s path Signed-off-by: Gabriel Adrian Samfira <gsamfira@cloudbasesolutions.com>	2023-12-18 15:24:52 +00:00
Gabriel Adrian Samfira	c4b2a3cd1f	Update Dockerfile Add new providers to Dockerfile: * k8s * lxd * incus Signed-off-by: Gabriel Adrian Samfira <gsamfira@cloudbasesolutions.com>	2023-12-18 14:56:05 +00:00
Gabriel Adrian Samfira	d1d8bfa703	Update docs Signed-off-by: Gabriel Adrian Samfira <gsamfira@cloudbasesolutions.com>	2023-12-18 14:49:36 +00:00
Gabriel Adrian Samfira	affb56f9a0	Remove the LXD internal provider Canonical have relicensed the LXD project to AGPLv3. This means that we can no longer update the go LXD client without re-licensing GARM as AGPLv3. This is not desirable or possible. The existing code seems to be Apache 2.0 and all code that has already been contributed seems to stay as Apache 2.0, but new contributions from Canonical employees will be AGPLv3. We cannot risc including AGPLv3 code now or in the future, so we will separate the LXD provider into its own project which can be AGPLv3. GARM will simply execute the external provider. If the client code of LXD will ever be split from the main project and re-licensed as Apache 2.0 or a compatible license, we will reconsider adding it back as a native provider. Although in the long run, I believe external providers will be the only option as they are easier to write, easier to maintain and safer to ship (a bug in the provider does not crash GARM itself). Signed-off-by: Gabriel Adrian Samfira <gsamfira@cloudbasesolutions.com>	2023-12-18 12:16:48 +00:00
Gabriel	fc7a7dde35	Merge pull request #193 from gabriel-samfira/prevent-api-spam Prevent abusing the GH API	2023-12-16 14:06:30 +02:00
Gabriel Adrian Samfira	459906d97e	Prevent abusing the GH API On large deployments with many jobs, we cannot check each job that we recorded in the DB against the GH API. Before this change, if a job was updated more than 10 minutes ago, garm would check against the GH api if that job still existed. While this approach allowed us to maintain a consistent view over which jobs still exist and which are stale, it had the potential of spamming the GH API, leading to rate limiting. This change uses the scale-down loop as an indicator for job staleness. If a job remains in queued state in our DB, but has dissapeared from GH or was serviced by another runner and we never got the hook (garm was down or GH had an issue - happened in the past), then garm will spin up a new runner for it. If that runner or any other runner is scaled down, we check if we have jobs in the queue that should have matched that runner. If we did, there is a high chance that the job no longer exists in GH and we can remove the job from the queue. Of course, there is a chance that GH is having issues and the job is never pushed to the runner, but we can't really account for everything. In this case I'd rather avoid rate limiting ourselves. Signed-off-by: Gabriel Adrian Samfira <gsamfira@cloudbasesolutions.com>	2023-12-15 22:41:50 +00:00
Gabriel	46ac1b8166	Merge pull request #191 from gabriel-samfira/update-readme Add some more info to README	2023-12-11 17:53:32 +02:00
Gabriel Adrian Samfira	71c741c43a	Add some more info to README Signed-off-by: Gabriel Adrian Samfira <gsamfira@cloudbasesolutions.com>	2023-12-11 15:48:28 +00:00
Gabriel	c6ec83a7c6	Merge pull request #190 from gabriel-samfira/update-readme Add the k8s provider to the list	2023-12-11 17:22:18 +02:00
Gabriel Adrian Samfira	71657bd06b	Add the k8s provider to the list Signed-off-by: Gabriel Adrian Samfira <gsamfira@cloudbasesolutions.com>	2023-12-11 15:17:12 +00:00
Gabriel	c712366663	Merge pull request #189 from gabriel-samfira/add-option-to-disable-jit-config Add option to disable JIT config	2023-12-11 16:15:45 +02:00
Gabriel Adrian Samfira	49e06efdf8	Update sample config Signed-off-by: Gabriel Adrian Samfira <gsamfira@cloudbasesolutions.com>	2023-12-11 14:05:10 +00:00
Gabriel Adrian Samfira	85968598b0	Add option to disable JIT config This change adds a flag on providers that allows users to disable JIT configuration even when it's available. For context, JIT is available on github.com and any GHES instance >=3.10. This option is a stopgap measure for providers that have not yet been updated to use JIT configs instead of runner registration tokens. Signed-off-by: Gabriel Adrian Samfira <gsamfira@cloudbasesolutions.com>	2023-12-11 12:37:33 +00:00
Gabriel	0e36eb7056	Merge pull request #188 from mercedes-benz/bypass_additional_environment_variables feat: passthrough additional env vars to provider bin	2023-12-01 13:06:27 +02:00
Mario Constanti	927a1a4308	feat: build garm with go 1.21 Signed-off-by: Mario Constanti <mario.constanti@mercedes-benz.com>	2023-12-01 11:56:06 +01:00
Mario Constanti	215bd71855	feat: passthrough additional env vars to provider as some provider binaries probably need additional environment variables set (e.g kubernetes as client-go depends on KUBERNETES_SERVICE_ vars) it should be possible to define a list of environment variables which should get bypassed into the provider binary execution	2023-12-01 11:54:34 +01:00
Gabriel	05e179604d	Merge pull request #183 from gabriel-samfira/force-remove-runner Add force delete runner	2023-11-21 20:56:24 +02:00
Gabriel Adrian Samfira	b9c7c93f7f	Fix linting issues Signed-off-by: Gabriel Adrian Samfira <gsamfira@cloudbasesolutions.com>	2023-11-21 20:52:40 +02:00
Gabriel	70501ffc78	Merge pull request #3 from mihaelabalutoiu/add-more-integration-tests Add more integration tests	2023-11-21 20:40:29 +02:00
Mihaela Balutoiu	c563ace750	Add integration tests for `test external provider` Signed-off-by: Mihaela Balutoiu <mbalutoiu@cloudbasesolutions.com>	2023-11-06 13:03:25 +02:00
Gabriel Adrian Samfira	d09f12dfd8	Add force delete runner This branch adds the ability to forcefully remove a runner from GARM. When the operator wishes to manually remove a runner, the workflow is as follows: * Check that the runner exists in GitHub. If it does, attempt to remove it. An error here indicates that the runner may be processing a job. In this case, we don't continue and the operator gets immediate feedback from the API. * Mark the runner in the database as pending_delete * Allow the consolidate loop to reap it from the provider and remove it from the database. Removing the instance from the provider is async. If the provider errs out, GARM will keep trying to remove it in perpetuity until the provider succedes. In situations where the provider is misconfigured, this will never happen, leaving the instance in a permanent state of pending_delete. A provider may fail for various reasons. Either credentials have expired, the API endpoint has changed, the provider is misconfigured or the operator may just have removed it from the config before cleaning up the runners. While some cases are recoverable, some are not. We cannot have a situation in which we cannot clean resources in garm because of a misconfiguration. This change adds the pending_force_delete instance status. Instances marked with this status, will be removed from GARM even if the provider reports an error. The GARM cli has been modified to give new meaning to the --force-remove-runner option. This option in the CLI is no longer mandatory. Instead, setting it will mark the runner with the new pending_force_delete status. Omitting it will mark the runner with the old status of pending_delete. Fixes: #160 Signed-off-by: Gabriel Adrian Samfira <gsamfira@cloudbasesolutions.com>	2023-10-12 06:15:36 +00:00
Gabriel	7f4f4bd7e1	Merge pull request #182 from gabriel-samfira/update-common Update garm-provider-common	2023-10-09 14:02:09 +03:00
Gabriel Adrian Samfira	26dbc3d8e5	Update garm-provider-common This update pulls in the latest version of garm-provider-common which removes its dependency on go-github, making future updates much less painful. Signed-off-by: Gabriel Adrian Samfira <gsamfira@cloudbasesolutions.com>	2023-10-09 10:55:11 +00:00
Gabriel	7fda604a37	Merge pull request #180 from mercedes-benz/additional_metrics feat: add new metrics	2023-10-06 11:43:00 +03:00
Mario Constanti	58e8b3454c	feat: add new metrics add info metrics about providers, enterprises, organizations, repositories and pools. Also expose most of the configurable pool information as metric like e.g. max Runners as garm_pool_max_runners Signed-off-by: Mario Constanti <mario.constanti@mercedes-benz.com>	2023-10-06 10:21:56 +02:00
Gabriel	a48ec0c0a8	Merge pull request #163 from gabriel-samfira/add-jit-config Add jit config	2023-09-24 17:15:38 +03:00
Gabriel Adrian Samfira	019948acbe	Add JIT config as part of instance create We must create the DB entry for a runner with a JIT config included. Adding it later via an update runs the risk of having the consolidate loop pick up the incomplete instance. Signed-off-by: Gabriel Adrian Samfira <gsamfira@cloudbasesolutions.com>	2023-09-24 13:51:17 +00:00
Gabriel Adrian Samfira	8c507a9251	Run go generate Signed-off-by: Gabriel Adrian Samfira <gsamfira@cloudbasesolutions.com>	2023-09-24 13:51:17 +00:00
Gabriel Adrian Samfira	4bedb1dd63	Fix URLs for enterprises Signed-off-by: Gabriel Adrian Samfira <gsamfira@cloudbasesolutions.com>	2023-09-24 13:51:17 +00:00
Gabriel Adrian Samfira	5f2cb19503	Use accessors when getting response values Signed-off-by: Gabriel Adrian Samfira <gsamfira@cloudbasesolutions.com>	2023-09-24 13:51:17 +00:00
Gabriel Adrian Samfira	e238f84781	update modules Signed-off-by: Gabriel Adrian Samfira <gsamfira@cloudbasesolutions.com>	2023-09-24 13:51:16 +00:00
Gabriel Adrian Samfira	e53c271337	Add metadata URLs Signed-off-by: Gabriel Adrian Samfira <gsamfira@cloudbasesolutions.com>	2023-09-24 13:50:21 +00:00
Gabriel Adrian Samfira	1268507ce6	Add jit config routes Signed-off-by: Gabriel Adrian Samfira <gsamfira@cloudbasesolutions.com>	2023-09-24 13:50:20 +00:00
Gabriel Adrian Samfira	5214aca228	Add jit config for new runner Signed-off-by: Gabriel Adrian Samfira <gsamfira@cloudbasesolutions.com>	2023-09-24 13:49:57 +00:00
Gabriel Adrian Samfira	6dea1c1937	Add temporary replace to fork Signed-off-by: Gabriel Adrian Samfira <gsamfira@cloudbasesolutions.com>	2023-09-24 13:49:56 +00:00
Gabriel Adrian Samfira	d5f8cf079e	Ignore instances that are still being created from reaping When using JIT runners, we register the runner on GitHub before we get a chance to spin up the instance in the provider. In such cases, we end up with a runner in "offline" state while we're creating the actual resource that will embody the runner. This change will give runners a chance to come online before garm tries to clean them up. Signed-off-by: Gabriel Adrian Samfira <gsamfira@cloudbasesolutions.com>	2023-09-24 13:49:06 +00:00

1 2 3 4 5 ...

621 commits