Career | <?phpecho $jobTitle;?> | <?phpecho $companyName;?>

Data Engineer-Web Scraping

Cyble

Bangalore, IN / Karnataka, IN / Onsite/Remote
  • Job Type: Full-Time
  • Function: Data Science
  • Industry: AI/ML
  • Post Date: 07/03/2024
  • Website: cyble.io
  • Company Address: 1175 Cicero Dr, Alpharetta, Georgia 30022, US

About Cyble

Cyble provides capabilities for customers to manage cyber risks with AI-powered actionable threat intelligence. We are specialists in gathering intelligence across the Deepweb, Darkweb and Surface Web

Job Description

Cyble provides the fastest and most comprehensive coverage across adversaries, infrastructure, exposure, weaknesses, and targets.

Cyble empowers governments and enterprises to safeguard their citizens and infrastructure by providing critical intelligence in a timely manner and enabling rapid detection, prioritization, and remediation of security threats through its advanced capabilities for data analysis, expert insights, and automated processes.

Headquartered in Alpharetta, Georgia, and with offices in Australia, Malaysia, Singapore, Dubai, Saudi Arabia and India, Cyble has a global presence. To learn more about Cyble, visit www.cyble.com.

  • Responsible for Creating and managing website scraping configurations on web
scraping tool.
  • Responsible for monitoring scraping configurations for potential errors and
blockages.
  • Responsible for monitoring data being scraped to identify potential issues and
blockages.
  • Responsible for coordinating with stakeholders to understand scraping task
requirements and reporting issues.
  • Responsible for preparing and sharing periodic scraping activity reports with
stakeholders.
  • Minimum 3-year experience of working as a data collection and quality
engineer/lead.
  • Should have hands-on experience of building and managing scraping configurations
for large number of websites.
  • Should have hands-on experience of working with third-party web scraping tools.
  • Should have hands-on experience of working with open-source web scraping libraries

(e.g Scrapy, Selenium etc.)
  • Should have good understanding of data ingestion and processing pipelines.
  • Should have hands-on experience of implementing and managing data quality

checks.
  • Should be familiar with programming Languages like Python and Go.
  • Should be Fluent with web technology concepts like HTML, DOM, CSS, XPATH etc.
  • Should be familiar with usage of regular expressions for data selection and cleaning

purpose.
  • Should be familiar with Windows and Linux Operating systems and general

networking concepts.

INR ₨1,000,000.00 - INR ₨2,000,000.00 /Yr.

We use cookies to customize your user experience. Click “Agree” if you agree with our Policy.