polito.it
Politecnico di Torino (logo)

Automating the Extraction of Professional Links from Law Firm Websites

Harsh Lalitbhai Vasoya

Automating the Extraction of Professional Links from Law Firm Websites.

Rel. Daniele Apiletti. Politecnico di Torino, Corso di laurea magistrale in Data Science And Engineering, 2025

[img]
Preview
PDF (Tesi_di_laurea) - Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives.

Download (5MB) | Preview
Abstract:

The goal of this project is to create an automated method for extracting connections from professional profiles on law firm websites. Because these websites have different architectures, a versatile web scraping solution was created utilizing Cloudscraper, BeautifulSoup, Requests, and Selenium to guarantee accuracy and flexibility. For convenience and organization, the retrieved links are methodically saved in CSV files.

Relatori: Daniele Apiletti
Anno accademico: 2024/25
Tipo di pubblicazione: Elettronica
Numero di pagine: 53
Soggetti:
Corso di laurea: Corso di laurea magistrale in Data Science And Engineering
Classe di laurea: Nuovo ordinamento > Laurea magistrale > LM-32 - INGEGNERIA INFORMATICA
Aziende collaboratrici: TALENT ACQUISITION PARTNER SRL
URI: http://webthesis.biblio.polito.it/id/eprint/35368
Modifica (riservato agli operatori) Modifica (riservato agli operatori)