1001 Freelance Projects
Latest Projects from Freelance Marketplaces
Today is: 16-May-2024 07:32 GMT
View Project
View this project in detail (Note: you will be redirected to external marketplace)
Project title: Amazon scraper +WP All Import Plugin
Posted by: External project from Upwork
Started: 19-Feb-2021 10:17 GMT
Description: This is a new updated/revised job description.


IN BRIEF:


I have installed WP All Import (WPAI) on my Wordpress website

https://www.wpallimport.com/


My site uses Advanced Custom Fields (ACF) so I have also installed the ACF Addon for WPAI

https://www.wpallimport.com/advanced-custom-fields/


My site is on a webhosting vps account over at webhostpython com


My website is a book deals website, which lists the best book deals every day


I find approx 1/3rd of the deals listed on a 3rd party book site: website 'x'


I need a scraper that will visit website 'x', grab its list of books (and some details), then visit each book's Amazon page, and scrape each book's details from Amazon


The scraper will then drop a csv file on my server with ALL the scraped details (from both sites)


In Summary, there are 3 steps:


# 1 visit website 'x', collect book details

#2 visit Amazon pages for each book and scrape Amazon book details

#3 create updateable csv file on server


THEN

Ensure WP All Import captures details/fields correctly (see below, most of this is already done)


IN DETAIL:

:

#1 Website 'x' does not have an RSS feed but updates daily.

= therefore, when the scraper/script visits the site it must make a record of ALL books scraped so it will ONLY scrape new books (today's books) each day.


NOTE: Website 'x' is a privately owned website, I do not want to hammer their resources or be flagged in any way, EG: Be respectful, I don't have permission to scrape it.


#2 The site is only a 'listing' of books, with links to book pages on Amazon, so the scraper must go to the site, collect links to Amazon, then visit those links and scrape each Amazon page.

+ scraper will need to visit multiple pages on the site


I have manually captured the required details from BOTH site 'x' and Amazon and created a sample csv file which you can use = all columns correctly labelled etc


I will provide EXACT details of what to scrape, with marked up screenshots, all communications will be very clear and easy to understand.


I don't mind what platform is used to build the scraper/submitter (php, python, etc) so long as it scrapes and submits.


WP All Import:


If you have experience with WP All Import, that is good, but probably not necessary. I have been able to figure out 90% of the necessary settings.


There are a couple of settings I cannot figure out, which maybe need a php function added, I would like you to check/edit the settings to ensure the import runs smoothly/correctly.


EG #1 I can't figure out how to match categories from csv to categories on my site!


EG #2: A custom field needs to display a date, and I don't know how to do that


EG#3: Some book titles need small text changes such as this title:

'Hunt for Justice Box Set: Books 1–2'

= IF a book title contains a colon ':' THEN remove the colon and ALL that follows it

So, title will now display as: 'Hunt for Justice Box Set'

ALSO

= IF a book title contains the words 'Box Set' THEN change 'Box' to 'Boxed' and surround words with brackets

So, title will now display as 'Hunt for Justice (Boxed Set)

ALSO

= If a title contains the words 'Omnibus', or 'Complete Omnibus', or 'The Complete Ominibus' THEN change to 'Omnibus' and surround with brackets

So, title will display as 'book title (Omnibus)


= everything else I have done.


Additional info:


I can provide some additional precise details to the person who creates this. I have worked with a few developers for this kind of scraper / submitter, so I already know most of the details / issues which relate to it so this is a very simple job for someone who knows what they are doing.


Future Projects:


There are also another 3-5 sites I want to scrape in the same way, so there may be additional similar projects for the right worker.


Please note, these are book sites, they are not high-earners. I do not have large budgets for any of my work. So I am looking for low cost solutions. But I write very good reviews for good workers!


Please write to me with an accurate assessment of how much time this would take you, when you can start and how much you will charge. I will ensure I provide as many details as possible, so the job doesn't have any 'unexpected surprises'.


Thank you.

Budget: $50

Posted On: February 19, 2021 10:17 UTC
Category: Data Extraction
Skills:Web Scraper, Python, PHP, Data Scraping

Skills: Web Scraper, Python, PHP, Data Scraping
Country: United Kingdom

click to apply
Project ID: 3142626
Project category: Web Scraper, Python, PHP, Data Scraping
Project budget: $50
View this project in detail (Note: you will be redirected to external marketplace)
Last Projects / Browse Projects
  Project Started
Minimalist Logo Design
Category: 3D Design, Graphic Design, Illustration, Logo Design, Photoshop
Budget: ₹600 - ₹1500 INR
16-May-2024
04:04 GMT
iOS Form App for Returns Inspection
Category: IPad, IPhone, Mobile App Development, Objective C, Swift
Budget: $30 - $250 AUD
16-May-2024
04:04 GMT
Online Booking System for Massage Center
Category: Web Hosting, Website Management
Budget: $10 - $30 USD
16-May-2024
04:02 GMT
Casual Men's Shirt Graphic Design
Category: Graphic Design, Logo Design, Photoshop, Photoshop Design, T Shirts
Budget: $250 - $750 USD
16-May-2024
04:02 GMT
Minimalistic Illustration for Brand Awareness
Category: Caricature & Cartoons, Graphic Design, Illustration, Logo Design
Budget: $14 - $30 NZD
16-May-2024
04:02 GMT
Architectural Structure Analysis
Category: Building Architecture, Civil Engineering, Structural Engineering
Budget: $10 - $30 USD
16-May-2024
03:57 GMT
Advanced Real Estate Image Retouching
Category: Adobe Lightroom, Graphic Design, Photo Editing, Photography, Photoshop
Budget: $10 - $100 USD
16-May-2024
03:56 GMT
Urgent: Set Up Cloudpanel on VPS
Category: Apache, Linux, MySQL, System Admin, UNIX
Budget: $10 - $30 USD
16-May-2024
03:56 GMT
Desarrollador Joomla
Category: CSS, Drupal, HTML, Joomla, PHP
Budget: $2 - $8 USD
16-May-2024
03:56 GMT
Professional Email Configuration on Registered Domain
Category: Email Handling, Website Management
Budget: ₹100 - ₹400 INR
16-May-2024
03:53 GMT
Dual Social Media Marketing Campaign
Category: Facebook Marketing, Instagram Marketing, Social Media Marketing
Budget: $30 - $250 NZD
16-May-2024
03:53 GMT
Urgent Data Entry Freelancer Needed
Category: Data Entry, Data Processing, Excel, Virtual Assistant, Web Search
Budget: $15 - $25 USD
16-May-2024
03:53 GMT
Discord Video Tutorial Creation
Category: Discord, Discord API
Budget: $2 - $8 AUD
16-May-2024
03:52 GMT
Home Network SonicWall Firewall Setup
Category: Firewall, Network Administration, Network Engineering, Network Security, System Admin
Budget: $30 - $250 USD
16-May-2024
03:50 GMT
page builder is wp bakery WordPress Website Design and Content Update
Category: Graphic Design, HTML, PHP, Web Design, WordPress
Budget: ₹1500 - ₹12500 INR
16-May-2024
03:49 GMT
Browse All Projects
Projects by Skills ...
Projects for 'android'
Projects for 'ajax'
Projects for 'asp'
Projects for 'aspnet'
Projects for 'cms'
Projects for 'cpp'
Projects for 'csharp'
Projects for 'css'
Projects for 'delphi'
Projects for 'design'
Projects for 'drupal'
Projects for 'excel'
Projects for 'facebook'
Projects for 'flash'
Projects for 'html'
Projects for 'java'
Projects for 'javascript'
Projects for 'joomla'
Projects for 'iphone'
Projects for 'mysql'
Projects for 'photoshop'
Projects for 'php'
Projects for 'python'
Projects for 'ruby'
Projects for 'seo'
Projects for 'sql'
Projects for 'sysadm'
Projects for 'translate'
Projects for 'typing'
Projects for 'twitter'
Projects for 'vbnet'
Projects for 'xml'
Projects for 'wordpress'
Projects for 'writing'
Read RSS feeds ... New!
RSS feed for 'android'
RSS feed for 'ajax'
RSS feed for 'asp'
RSS feed for 'aspnet'
RSS feed for 'cms'
RSS feed for 'cpp'
RSS feed for 'csharp'
RSS feed for 'css'
RSS feed for 'delphi'
RSS feed for 'design'
RSS feed for 'drupal'
RSS feed for 'excel'
RSS feed for 'facebook'
RSS feed for 'flash'
RSS feed for 'html'
RSS feed for 'java'
RSS feed for 'javascript'
RSS feed for 'joomla'
RSS feed for 'iphone'
RSS feed for 'mysql'
RSS feed for 'photoshop'
RSS feed for 'php'
RSS feed for 'python'
RSS feed for 'ruby'
RSS feed for 'seo'
RSS feed for 'sql'
RSS feed for 'sysadm'
RSS feed for 'translate'
RSS feed for 'typing'
RSS feed for 'twitter'
RSS feed for 'vbnet'
RSS feed for 'xml'
RSS feed for 'wordpress'
RSS feed for 'writing'
New!
Проекты на русском
(Projects in Russian)

Long URL:
www.1001freelanceprojects.com
Mobile version:
m.1001fp.com
Copyright © 2005-2022 1001 Freelance Projects