question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Error deploying AWS EC2 runner in Gitlab

See original GitHub issue

Hey everyone, I am not being able to use the cml runner in my gitlab CI pipeline.

I forked the repo: https://gitlab.com/iterative.ai/cml-cloud-case for a quick test before setting the pipeline in my actual project.

I setted the CI/CD variables needed: image I setted both repo_token and REPO_TOKEN because it says lower case in Gitlab’s repo README and upper case in CML Documentation.

And I am getting this error: job.log For me it looks like the job is not getting a “heartbeat” from the runner deployed.

I tried some different machines but right now my gitlab-ci.yml is like this:

stages:
  - deploy
  - train

deploy_job:
  stage: deploy
  when: always
  image: dvcorg/cml
  script:
    - cml-runner
      --cloud aws
      --cloud-region us-east
      --cloud-type t2.micro
      --cloud-hdd-size 64
      --labels=cml-runner-gpu

train_job:
  stage: train
  when: on_success
  image: dvcorg/cml-py3
  tags:
    - cml-runner-gpu

  script:
    - apt-get update -y
    - apt-get install python3-dev -y
    - pip install -r requirements.txt
    - cat gpu_info.txt >> report.md
    
    - cml-send-comment report.md 

One weird thing is that the machine is being created, but the job is falling. image

If I could provide any other relevant information please let me know!

Thank you all in advance!

Issue Analytics

  • State:closed
  • Created 2 years ago
  • Comments:10 (7 by maintainers)

github_iconTop GitHub Comments

3reactions
dacbdcommented, Dec 6, 2021

You can target a specific vpc by creating a security group for it and then use the aws-security-group option. You could follow https://github.com/iterative/cml/pull/831 as it may be relevant to your setup.

1reaction
dacbdcommented, Dec 4, 2021

\u001b[32m 😭https://github.com/iterative/cml/pull/818 [^1]

[^1]: not yet released in image

Read more comments on GitHub >

github_iconTop Results From Across the Web

Deploy and Manage Gitlab Runners on Amazon EC2 - AWS
To deploy the Gitlab Runner stack: · Obtain the runner registration tokens of the Gitlab projects that you want registered to the Gitlab...
Read more >
Gitlab pipeline error With CD/CI for AWS ec2 debian instance
The error indicates the runner won't start because you've configured a user for the runner to run as, but you can only do...
Read more >
Gitlab docker runner on AWS EC2 started failing abruptly
Our builds using docker+machine executor started failing abruptly, our machines are on AWS EC2 instances. There were no changes to the ...
Read more >
CICD fails to execute jobs on EC2 instances - GitLab Forum
The GitLab runner is configured to spawn a new AWS EC2 instance to run CICD jobs. ... ERROR: Job failed (system failure): exit...
Read more >
Autoscaling GitLab Runner on AWS EC2
In this tutorial, we'll explore how to properly configure GitLab Runner in AWS. The instance in AWS will serve as a Runner Manager...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found