1001 Freelance Projects
Latest Projects from Freelance Marketplaces
Today is: 16-May-2024 19:59 GMT
View Project
View this project in detail (Note: you will be redirected to external marketplace)
Project title: 90% DONE PHP Amazon scraper +WP All Import Plugin
Posted by: External project from Upwork
Started: 01-Mar-2021 11:23 GMT
Description: Hello,


it's nuts that I have to write this job post, but here goes:


A developer has been doing some work for me, and is 95% - 99% finished, but now he says he doesn't have time to finish and has left the job. We were very close to 'handover' -- there are just a couple of outstanding issues, probably an hour of work left to do (or less!).


The job is a scraper that visits site#1 (a list of books) scrapes some details, then visits Amazon and scrapes some more details, then drops all the details into a csv file on my server.


That scraper has been done, and a sample data file is in the folder on my server.

It was created with php. It's finished -- but there are a couple of issues:


= I don't think the cron job is set up correctly, because the scraper isn't running each day

= the scrape results show some rows are missing info (only some of them) so the scraper is stopping its scrape occassionally, which needs to be corrected


THAT'S ALL


The second part of the job was to set up the WP All Import plugin to 'capture' the scrape details from the csv file. I have done 99% of it but there are a couple things I cannot figure out:


= I can't figure out how to 'match' the imported categories to the categories on my site (my tests are creating new categories and I don't want that)

= I can't figure out how to make all results set 'todays date' (current day's date) in the date fields (I think it's just a matter of adding a small snippet of php script to the field)

= And one other thing I can explain to the person doing this


THAT'S ALL


It's very surprising the developer dropped the job, as it really wouldn't take him much time to finish. But that's the situation.


-----------------------------------------------

ORIGINAL JOB DESCRIPTION BELOW:


I have installed WP All Import (WPAI) on my Wordpress website

https://www.wpallimport.com/


My site uses Advanced Custom Fields (ACF) so I have also installed the ACF Addon for WPAI

https://www.wpallimport.com/advanced-custom-fields/


My site is on a webhosting vps account over at webhostpython com


My website is a book deals website, which lists the best book deals every day


I find approx 1/3rd of the deals listed on a 3rd party book site: website 'x'


I need a scraper that will visit website 'x', grab its list of books (and some details), then visit each book's Amazon page, and scrape each book's details from Amazon


The scraper will then drop a csv file on my server with ALL the scraped details (from both sites)


In Summary, there are 3 steps:


# 1 visit website 'x', collect book details

#2 visit Amazon pages for each book and scrape Amazon book details

#3 create updateable csv file on server


THEN

Ensure WP All Import captures details/fields correctly (see below, most of this is already done)


IN DETAIL:

:

#1 Website 'x' does not have an RSS feed but updates daily.

= therefore, when the scraper/script visits the site it must make a record of ALL books scraped so it will ONLY scrape new books (today's books) each day.


NOTE: Website 'x' is a privately owned website, I do not want to hammer their resources or be flagged in any way, EG: Be respectful, I don't have permission to scrape it.


#2 The site is only a 'listing' of books, with links to book pages on Amazon, so the scraper must go to the site, collect links to Amazon, then visit those links and scrape each Amazon page.

+ scraper will need to visit multiple pages on the site


I have manually captured the required details from BOTH site 'x' and Amazon and created a sample csv file which you can use = all columns correctly labelled etc


I will provide EXACT details of what to scrape, with marked up screenshots, all communications will be very clear and easy to understand.


I don't mind what platform is used to build the scraper/submitter (php, python, etc) so long as it scrapes and submits.


WP All Import:


If you have experience with WP All Import, that is good, but probably not necessary. I have been able to figure out 90% of the necessary settings.


There are a couple of settings I cannot figure out, which maybe need a php function added, I would like you to check/edit the settings to ensure the import runs smoothly/correctly.


EG #1 I can't figure out how to match categories from csv to categories on my site!


EG #2: A custom field needs to display a date, and I don't know how to do that


EG#3: Some book titles need small text changes such as this title:

'Hunt for Justice Box Set: Books 1–2'

= IF a book title contains a colon ':' THEN remove the colon and ALL that follows it

So, title will now display as: 'Hunt for Justice Box Set'

ALSO

= IF a book title contains the words 'Box Set' THEN change 'Box' to 'Boxed' and surround words with brackets

So, title will now display as 'Hunt for Justice (Boxed Set)

ALSO

= If a title contains the words 'Omnibus', or 'Complete Omnibus', or 'The Complete Ominibus' THEN change to 'Omnibus' and surround with brackets

So, title will display as 'book title (Omnibus)

Budget: $30

Posted On: March 01, 2021 11:23 UTC
Category: Data Extraction
Skills:Web Scraper, PHP, SQL, Data Scraping

Skills: Web Scraper, PHP, SQL, Data Scraping
Country: United Kingdom

click to apply
Project ID: 3145997
Project category: Web Scraper, PHP, SQL, Data Scraping
Project budget: $30
View this project in detail (Note: you will be redirected to external marketplace)
Last Projects / Browse Projects
  Project Started
Figma to Next.js Conversion + API
Category: MongoDB, Next.js, Node.js, React.js, Tailwind CSS
Budget: ₹4999 - ₹5000 INR
16-May-2024
16:04 GMT
30-sec Appliance Repair Promo Video
Category: After Effects, Video Editing, Video Production, Video Services, Videography
Budget: $30 - $250 USD
16-May-2024
16:04 GMT
springboot refresh jwt
Category: HTML, Java, JavaScript, PHP, Software Architecture
Budget: $30 - $250 USD
16-May-2024
16:03 GMT
Childhood Obesity: Dietary Habit Causes
Category: Report Writing, Research Writing
Budget: min $50 USD
16-May-2024
16:03 GMT
Multi-Service Photo Editing on Photoshop
Category: Graphic Design, Illustration, Photo Editing, Photoshop, Photoshop Design
Budget: $2 - $8 USD
16-May-2024
16:02 GMT
Australian Small Business Email List Building
Category: Internet Marketing, Market Research, Marketing, Research, Sales
Budget: $15 - $25 AUD
16-May-2024
16:02 GMT
Enhance Site Visibility with On-Page SEO
Category: Link Building, SEO, Website Management, WordPress
Budget: $25 - $50 USD
16-May-2024
16:02 GMT
Développer mon site internet
Category: Facebook Marketing, Link Building, SEO
Budget: €250 - €750 EUR
16-May-2024
16:02 GMT
digital marketing
Category: Digital Marketing, Facebook Ads, Social Media Marketing
Budget: $250 - $750 USD
16-May-2024
16:00 GMT
US Medical Billing Project for Sale
Category: Copy Typing, Copywriting, Data Entry, Data Processing, Research
Budget: ₹500000 - ₹1000000 INR
16-May-2024
15:59 GMT
WEB - One Home Propiedades
Category: CSS, Graphic Design, HTML, PHP, Web Design
Budget: $30 - $250 USD
16-May-2024
15:58 GMT
Lead Generating WordPress Website for IT Service Provider
Category: Graphic Design, HTML, PHP, Web Design, WordPress
Budget: $30 - $250 CAD
16-May-2024
15:57 GMT
Multifaceted Photoshop Designer Needed -- 2
Category: Banner Design, Graphic Design, Logo Design, Photoshop, Photoshop Design
Budget: $10 - $30 USD
16-May-2024
15:57 GMT
Comprehensive Marketing Strategy Development
Category: Business Analysis, Business Plans, Internet Marketing, Market Research, Marketing
Budget: min $50000 USD
16-May-2024
15:57 GMT
Microsoft Office Customization & Setup Help
Category: Microsoft Access, Microsoft Exchange, Microsoft Office, Microsoft Outlook, Sharepoint
Budget: $15 - $25 USD
16-May-2024
15:56 GMT
Browse All Projects
Projects by Skills ...
Projects for 'android'
Projects for 'ajax'
Projects for 'asp'
Projects for 'aspnet'
Projects for 'cms'
Projects for 'cpp'
Projects for 'csharp'
Projects for 'css'
Projects for 'delphi'
Projects for 'design'
Projects for 'drupal'
Projects for 'excel'
Projects for 'facebook'
Projects for 'flash'
Projects for 'html'
Projects for 'java'
Projects for 'javascript'
Projects for 'joomla'
Projects for 'iphone'
Projects for 'mysql'
Projects for 'photoshop'
Projects for 'php'
Projects for 'python'
Projects for 'ruby'
Projects for 'seo'
Projects for 'sql'
Projects for 'sysadm'
Projects for 'translate'
Projects for 'typing'
Projects for 'twitter'
Projects for 'vbnet'
Projects for 'xml'
Projects for 'wordpress'
Projects for 'writing'
Read RSS feeds ... New!
RSS feed for 'android'
RSS feed for 'ajax'
RSS feed for 'asp'
RSS feed for 'aspnet'
RSS feed for 'cms'
RSS feed for 'cpp'
RSS feed for 'csharp'
RSS feed for 'css'
RSS feed for 'delphi'
RSS feed for 'design'
RSS feed for 'drupal'
RSS feed for 'excel'
RSS feed for 'facebook'
RSS feed for 'flash'
RSS feed for 'html'
RSS feed for 'java'
RSS feed for 'javascript'
RSS feed for 'joomla'
RSS feed for 'iphone'
RSS feed for 'mysql'
RSS feed for 'photoshop'
RSS feed for 'php'
RSS feed for 'python'
RSS feed for 'ruby'
RSS feed for 'seo'
RSS feed for 'sql'
RSS feed for 'sysadm'
RSS feed for 'translate'
RSS feed for 'typing'
RSS feed for 'twitter'
RSS feed for 'vbnet'
RSS feed for 'xml'
RSS feed for 'wordpress'
RSS feed for 'writing'
New!
Проекты на русском
(Projects in Russian)

Long URL:
www.1001freelanceprojects.com
Mobile version:
m.1001fp.com
Copyright © 2005-2022 1001 Freelance Projects