Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Scraped position data incorrect when user has more than one position in same company

See original GitHub issue

The profile shown here has multiple roles at the same company, “Excelian | Luxoft Financial Services”.

Screen Shot 2020-06-09 at 7 45 46 pm

In the scraped profile, both of these roles are collapsed into a single one, where the company name is listed as the title and no companyName field is present. The dates shown are from the first role, but the description is from the second.

positions: [
    {
      title: 'Excelian | Luxoft Financial Services',
      link: 'https://www.linkedin.com/company/excelian/',
      url: 'https://www.linkedin.com/company/excelian/',
      location: 'Sydney, Australia',
      description: 'Mxml developer focused on confirmation work',
      date1: 'Sep 2016 – Present',
      date2: '3 yrs 10 mos'
    },

Issue Analytics

State:
Created 3 years ago
Comments:7 (5 by maintainers)

Top GitHub Comments

2reactions

leonardiwagnercommented, Jun 23, 2020

@grapevineai all of the scrapping is based on profileScraperTemplate.js:

template is an array of LinkedIn sections object, each object will be translated to be the final scrapedin result to the user, it has basically 2 properties:
- selector: a css selector which is an HTML container element which represents a LinkedIn section, such as Education or About.
- fields: it can be two options:
  - String: a CSS selector that will actually scrap an HTML element value to store and return to the user. It can be two options:
  - Object: will do recursively do the same job in the current section. It’s necessary when we have “elements inside other elements” such as roles from LinkedIn’s positions/experiences

It’s all based on CSS selectors as you can see, it’s not hard. The rest of the code is mostly to read that template and it barely has bugs. Thank you for your contribution, I spend my time here doing this project without earning a single penny so it’s good to have help, thank you 😄

1reaction

leonardiwagnercommented, Jun 23, 2020

That was fixed on v1.0.20 please update your version, feedback is appreciated! Thanks for reporting that @web-goddess

Top Results From Across the Web

python - I get the same job title for multiple jobs when using ...

The scraper works well, however for different job positions it returns the same title, whereas they should be different pending on the job...

How do I stop Indeed from scraping my job postings?

If you know that Indeed is scraping your Lever-hosted jobs and you would like them to stop, you can do so by contacting...

Your Approach to Hiring Is All Wrong - Harvard Business Review

Cappelli explores what's wrong with today's recruiting and hiring and how to ... data indicates that the most common reason employees consider a...

Job posting ( JobPosting ) structured data for Job Search

Improve the job seeking experience on Google by reviewing how you can markup your job posting web pages with structured data.

Reporting Jobs on LinkedIn | LinkedIn Help

If you come across a job on LinkedIn that you find to be inappropriate, broken, or incorrect, you can flag the job to...

Troubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.

Start Free

Top Related Reddit Thread

No results found

Top Related Tweet

No results found

Top Related Dev.to Post

No results found

Scraped position data incorrect when user has more than one position in same company

Issue Analytics

Top GitHub Comments

Top Results From Across the Web

Top Related Medium Post

Top Related StackOverflow Question

Troubleshoot Live Code

Top Related Reddit Thread

Top Related Hackernoon Post

Top Related Tweet

Top Related Dev.to Post

Top Related Hashnode Post

TypeError: Cannot read property 'name' of undefined -- version "^1.0.20"

TypeError: Cannot read property 'replace' of undefined