1001 Freelance Projects
Latest Projects from Freelance Marketplaces
Today is: 31-Mar-2025 02:59 GMT
View Project
View this project in detail (Note: you will be redirected to external marketplace)
Project title: AI specialist for advanced scraping tool for housing websites
Posted by: External project from PeoplePerHour
Started: 17-Dec-2024 12:34 GMT
Description: I am looking for an AI specialist with extensive experience in AI to develop a Windows Service in C# that can do the following:
Every day, visit a list of approximately 800 URLs of real estate agency websites and navigate through the pages to search for newly listed properties added by the agencies.

Next, these property pages must be read, and the relevant data extracted to be stored in a fixed format in tables on an SQL server.

A number of data fields are mandatory, such as:

The direct URL of the property page within the real estate agency's website (to enforce uniqueness)
The city where the property is located
The street where the property is located
The property type, where the choice comes from our fixed list: entire home, apartment, studio, etc. The engine must select the closest match from our list
The number of rooms
The monthly rental price
Whether this price includes or excludes service charges
The date the property is available
The surface area in square meters
A list of URLs of the photos associated with the property
Additionally, there is a list of optional fields we would like to retrieve if the information is available:

Municipality
District
Postal code
House number
Number of bedrooms
Number of bathrooms
Year of construction
Is there a: garden, garage, rooftop terrace, balcony?
Condition of the property
Is the property furnished?
...and so on
A complete list will be provided.

The challenge lies in the fact that each real estate agency uses a different paging method and different page layouts. Furthermore, some agencies include all the information in one block of text, while others display much of the data in columns. This can also change unexpectedly. Therefore, the software must be resilient and capable of understanding how to navigate through the pages to look for new properties.

A second challenge is that some agencies include photos of other nearby properties under the details of a specific property. The tool must recognize that these photos do not belong to the property in question and should ignore them.

Preferably, we would use—due to cost considerations—an AI model that does not rely on a commercial API, unless doing so offers such significant benefits that it is worthwhile.

I would love to hear about your experience and how you would approach this. Specifically: which AI method/engine you would use and the flow of the software.
Project ID: 3413051
Project category:
Project budget:
View this project in detail (Note: you will be redirected to external marketplace)
Last Projects / Browse Projects
  Project Started
Move word press site to WIX 30-Mar-2025
14:31 GMT
Logo enhancement 30-Mar-2025
14:06 GMT
Launch Assistant for Drink Brand (Shopify, Marketing, AI, ) 30-Mar-2025
13:19 GMT
Animator needed for Unreal 30-Mar-2025
13:09 GMT
Friends/Employees/Contacts for Feedback (Easy) 30-Mar-2025
11:15 GMT
Salesforce updates including automations with Panda Doc 30-Mar-2025
11:15 GMT
I need to alter (handwritten) text on 3 images 30-Mar-2025
11:00 GMT
Logo design 30-Mar-2025
08:54 GMT
Copy images into PDF template 30-Mar-2025
08:39 GMT
One Page Website 30-Mar-2025
08:39 GMT
Who can Chat Email or Call Meta 30-Mar-2025
08:39 GMT
Website content required. 30-Mar-2025
06:08 GMT
I need a flyer created 30-Mar-2025
06:05 GMT
Wix Template design 30-Mar-2025
06:05 GMT
A GREEN SCREEN BACKGROUND THAT LOOKS LIKE A REAL STUDIO SET 30-Mar-2025
03:25 GMT
Browse All Projects
Projects by Skills ...
Projects for 'android'
Projects for 'ajax'
Projects for 'asp'
Projects for 'aspnet'
Projects for 'cms'
Projects for 'cpp'
Projects for 'csharp'
Projects for 'css'
Projects for 'delphi'
Projects for 'design'
Projects for 'drupal'
Projects for 'excel'
Projects for 'facebook'
Projects for 'flash'
Projects for 'html'
Projects for 'java'
Projects for 'javascript'
Projects for 'joomla'
Projects for 'iphone'
Projects for 'mysql'
Projects for 'photoshop'
Projects for 'php'
Projects for 'python'
Projects for 'ruby'
Projects for 'seo'
Projects for 'sql'
Projects for 'sysadm'
Projects for 'translate'
Projects for 'typing'
Projects for 'twitter'
Projects for 'vbnet'
Projects for 'xml'
Projects for 'wordpress'
Projects for 'writing'
Read RSS feeds ... New!
RSS feed for 'android'
RSS feed for 'ajax'
RSS feed for 'asp'
RSS feed for 'aspnet'
RSS feed for 'cms'
RSS feed for 'cpp'
RSS feed for 'csharp'
RSS feed for 'css'
RSS feed for 'delphi'
RSS feed for 'design'
RSS feed for 'drupal'
RSS feed for 'excel'
RSS feed for 'facebook'
RSS feed for 'flash'
RSS feed for 'html'
RSS feed for 'java'
RSS feed for 'javascript'
RSS feed for 'joomla'
RSS feed for 'iphone'
RSS feed for 'mysql'
RSS feed for 'photoshop'
RSS feed for 'php'
RSS feed for 'python'
RSS feed for 'ruby'
RSS feed for 'seo'
RSS feed for 'sql'
RSS feed for 'sysadm'
RSS feed for 'translate'
RSS feed for 'typing'
RSS feed for 'twitter'
RSS feed for 'vbnet'
RSS feed for 'xml'
RSS feed for 'wordpress'
RSS feed for 'writing'
New!
Проекты на русском
(Projects in Russian)

Long URL:
www.1001freelanceprojects.com
Mobile version:
m.1001fp.com
Copyright © 2005-2024 1001 Freelance Projects