Archive

Posts Tagged ‘web data’

Web Data Extractor 8.1 Search Algorithm Help

June 2nd, 2011 Comments off

I recently bought the full version of Web Data Extractor Version 8.1.
I am trying to use it to extract speceific data from public (i.e. open access) sources.

I have not been able to get it to return the data I want yet.

I would like to find someone who is available ASAP, and from time to time, to help with this type of use of Web Data Extractor.

Ideal programmer is one who has used WDE v8.1, and who can create search parameters (i.e. application search configuration), and test same to verify extraction results, then direct me regarding best approach to run WDE.

Ongoing work opportunties.

Web Data Entry

April 28th, 2011 Comments off

We are looking for someone to do a web data entry for us. This job will involve searching and gathering specific the data from certain sites and then entering the collected data in our CMS in proper format. The step by step instructions will be given to the selected candidate.

http://simurl.com/kuzkos

Web Data Scraper

April 14th, 2011 Comments off

Hi,

We need a web based data scraper that can scrape data from this site

http://bit.ly/cSLvUP

We need this functionality

1. Enter Search term ie Physiotherapy
2. Retrieve Data – following all returned recordset ie visit each
3. Parse for duplicate phone numbers and return single record for each phone number
4. Export to csv/excel
3. The minimum data retrieved needed is:
Company Name
Company Phone Number
Company Address (street location, subsurb, state)
Company Website if available
Company Email if available
May need to consider web proxys

Thanks

Web Data Scraper

February 16th, 2010 Comments off

I need a scraper which will:

1. Login to a site with my username and password
2. Go to a specific ‘report’
3. Grab all the data from that report and store it in my database.
4. Allow me to view that data.

Summary of Data:
Data is in a table format and should be easily scrapable. It has column headers, which would be the same as the datatable field names.

There are 2 pages to scrape resulting in 2 separate datatables of information.

First Page Data looks like this:
http://www.sensicorp.com/staging/scraper/ScrapeData1.png

Second Page Data looks like this:
http://www.sensicorp.com/staging/scraper/ScrapeData2.png

Since this is date-sensitive information and dates can be inputted into the system – there should be the ability in the system to get older data by putting in the date range. This would be used to back-fill the database.

this should be pretty simple with Snoopy or other.

Thank you.

Web Data Extraction

February 3rd, 2010 Comments off

I need to be able to extract web data from all kinds of web sources and am looking for someone that specializes in this.

Web Data Entry

January 12th, 2010 Comments off

The project is very simple of filling out few information in 170 websites.

All data will be provided.

Max. time within 12 hours from acceptance.

Looking forward

Serious bidder need to apply & time wasters do excuse.

Scrabbing Web Data

December 30th, 2009 Comments off

Hi there,

I am looking for someone who can help scrap data from websites like wikipedia or imdb.com

I would like to put it into a database and then be able to have users access it through a website. I would like users to be able to sort the data by any of the column headers etc and to also be able to save out lists from the data.

I can negotiate price and if I like what you do the first time around, I would be willing to hire you for updates to the site.

Simple Web Data Script

December 8th, 2009 Comments off

Simple job of creating a script. the script needs to extract some data from one website, url in pmb any of the below methods are prefered.

only bid in budget.

This project was created last time for $15 but it has now stopped working

Web Data Input

December 2nd, 2009 Comments off

I work for a real estate company that uses many websits to market our properties. I am looking for someone to input 50 listings onto 2 websites. This included copy/pasting details, inputting bed and bath count, and also uploading pictures.

Experienced bidders only please!!

Web Data Extraction

November 10th, 2009 Comments off

I am looking for extracting Web Data of some categories. The no of records would be approximately 20,000. I am not looking for manual extraction or manual data entry. Interested providers should have automated extraction capability to bid. I will be interested to pay 1.5 cent per record. Each record will have 6-8 Columns. The data need to be provided as MYSQL dump after proper cleaning of special charecters and removal of duplicate etc.

Simple Web Data Script

November 8th, 2009 Comments off

Simple job of creating a script. the script needs to extract some data from one website, url in pmb any of the below methods are prefered.

only bid in budget.

This project was created last time for $15 but it has now stopped working

Web Data Grabber

November 7th, 2009 Comments off

Hello,
i need to display content of a page which changes on a daily basis.
i have supergrabber2.2 script which you can modify.
there are 3 pages which would have same code with different url and other site has 2 pages.
please see pmb for urls and more details

a quick project

Web Data Entry

October 22nd, 2009 Comments off

I need someone or a group to help enter links into a web admin onto the site into appropiate categories it addition to entering keywords that are relevant to the site with some other tasks.

The user or users if you have more than one would be given access to the two web admins and take the data to enter from one to the other with the additional information.

The web admin where the data is being entered has a crawl meta headers option that can be used in addition. There are about 400+ links or so to add. The user can add categories or subcategories to help with relevant info for the web directory data entry.

Web Data Extraction Of Product

October 16th, 2009 Comments off

We need someone to write a script that will get all product information from this site, http://www.r o l t a.co.uk/ and store each record in a excel spreadsheet so that we can import into Magento.

If you do a good price for this and a quick job we have another few extraction tasks and will always employ you for this type of work.

We will require a sample of around 50-100 records so we can check and make sure you are on the right track.

This script will need to also download images and store in a folder and add the relevant picture number to the excel spreadsheet so we can easily import.

Many thanks and look forward to you bids.

Web Data Extraction

October 16th, 2009 Comments off

I need to get records from http://www.studiofinder.com/advancedsearch.asp

Click ‘Any Genre’ in ‘Genre’ option.

You will get 14,469 results.

I need :

studio name
contact
email
web url
street address
city
state
country
rates

I need either the data or script.

Maximum budget is US$ 40.

No escrow** No escrow** No escrow**

Payment only after completion of project.

Project must be completed within 24 hours of accepting the task.

Web Data Extraction Lab 2

August 21st, 2009 Comments off

This is a Desktop Application
This will be written in Visual Basic .Net 2005
Microsoft Windows – NOT Vista, or Windows7

Your Experience needed:
Visual Basic .NET 2005 platform
MS Access Database programming
Expert Visual Basic GUI/form programmer.
Comfortable using Treeview, Liked lists containing nodes.

This Project is for an MDI GUI application, whose main purpose is to Organize html based data extraction projects. It is a desktop windows application with explorer window forms, and Project configuration forms.

Please, no subcontract companies bid on this project. I have a Computer Science Degree, and want to work directly with the programmer, or the lead programmer, of the team.

Payment will be through Paypal only. On startup, I won’t advance any money.

I won’t pay anything until it is established that you can deliver something meaningful. The payments will be in sums along the way. You deliver 25% of the project, and I’ll pay 25 % of the total, or I’ll pay for your time up to that time. If we agree, and you start the project, and you haven’t produced anything by the end of a month, don’t expect any payment. I will notify you, and the agreement will be void and null.

This is a feature driven project, and my requirements will change as the project evolves, and as I see new opportunities.

This is a feature driven project. I will let you know what I want delivered next, and that is what you work on next.

Your task will be to write a GUI interface. The main functionality you are creating is an Explorer system, which has data supplied by an MS Access database.

I have already completed the Processing Engine this application will use to load, and process the html data.

Web Data Scraping

August 18th, 2009 Comments off

I need to scrape a bunch of data for analysis purposes off of googlebase real estate listings. I will provide the search criteria to return the approximately 20K records needing to be written to excel or CSV. I don’t need the program, just the output file. PM me for additional information about what the scraping criteria and fields will contain.

Thanks,

Simple Web Data Script

June 30th, 2009 Comments off

simple job of creating a script. the script needs to extract some data from one website, url in pmb any of the below methods are prefered.

only bid in budget.

This project was created last time for $15 but it has now stopped working.

Web Data Contact Lists

May 17th, 2009 Comments off

I need an excel sheet of contact information from websites/Portal from a list that I will provide. This contact info has to be accurate with no duplications.

It needs to contain most if not all of the following:
Name of Company, principal email address, URL, phone #, fax # and snails mail addresses-with (city, state, country, zip) , description of company, Do they have company video- Yes or No.
It will be based per contact mot hour. Please provide sample of previous work.

Go down each of my categories on my site.

Plug in the keyword on the search box for instance go to Hoover (dot)com –business listings and many others.

What info that is missing I need a google search done to get that missing info. No duplicates will be accepted.

Please show me the excel spreadsheet when you bid so I know you understand the project broken down in my format above. Please show me examples of prior work.. Please bid per $10 on how many listing will be completed.

Web Data Extraction

May 7th, 2009 Comments off

I am wanting to be able to extract data from an online horse racing database and output it on a regular basis to html and also output as a file to be used in an excel spreadsheet or similar.
This would appear to be a straightforward job for a team with good knowledge of spider scripts, web extraction and parsing of data.

See attachment for details.

Web Data Contact Lists

April 18th, 2009 Comments off

I need an excel sheet of contact information from websites/Portal from a list that I will provide. This contact info has to be accurate with no duplications.

It needs to contain most if not all of the following:
Name of Company, principal email address, URL, phone #, fax # and snails mail addresses-with (city, state, country, zip) , description of company, Do they have company video- Yes or No.
It will be based per contact mot hour. Please provide sample of previous work.

Go down each of my categories on my site.

Plug in the keyword on the search box for instance go to Hoover (dot)com –business listings and many others.

What info that is missing I need a google search done to get that missing info. No duplicates will be accepted.

Please show me the excel spreadsheet when you bid so I know you understand the project broken down in my format above. Please show me examples of prior work.. Please bid per $10 on how many listing will be completed.

Fetching Web Data Into Epesi

April 1st, 2009 Comments off

Need a module working under EPESI custom platform. It will fetch/grab data on demand and daily basis from websites such as skapiec.pl, idealo.de, etc.
Data would have to be matched based on upc/ean, mpn, sku, product name, etc,… and displayed on a webpage with multi or single line items (if automatically can not be done it should be done manually).
The main goal is to scrape over 100 websites, rather close to 200. One scraper per website is fine as long as there is a configuration that will handle all the scrapers setting, etc.

Websites should contain:

Deliverables:
1) Complete and fully-functional working program(s) as well as complete source code of all work done.
2) Deliverables must be in ready-to-run condition, as follows (depending on the nature of the deliverables):
a) For web sites or other server-side deliverables intended to only ever exist in one place in the Buyer’s environment–Deliverables must be installed by the Seller in ready-to-run condition in the Buyer’s environment.
b) For all others including software the buyer intends to distribute: A software installation package that will install the software in ready-to-run condition on the platform(s) specified in this bid request.
3) All deliverables will be considered “work made for hire” under U.S. Copyright law. Buyer will receive exclusive and complete copyrights to all work purchased. (No GPL, GNU, 3rd party components, etc. unless all copyright ramifications are explained AND AGREED TO by the buyer on the site per the coder’s Seller Legal Agreement).
4) NDA

Requirements:
1. Has to work under EPESI platform located at http://sourceforge.net/projects/epesi
2. Common identification of products such as UPC/EAN, MPN, SKU. If no identification of product found it will be matched by Title.
3. Whenever possible use functions already used in the system, no overhead accepted
4. Products that are scrapped (grabbed) from price search websites should be matched as per specification
5. Products need to have price search action button next to it in WMS module as well ability to search using autocomplete function.
6. New sites to scrap should be easy to add (no more than 2 hour of work to add new site by admin) and utilize plugins
7. Sites that are already done have to be configurable in order for admin to make script corrections
8. Websites categories should be matched with categories in wms and ecommerce module, therefore with a single click desired products from a chosen category should be fetched. Action button in WMS module.
9. Multiple fetching hosts. Hosts will fetch data from multiple websites locations and place it in the same db. Master module will command slave modules how, when ,and from which website to fetch the data. Master module will tell which fetching host to use for which website. It should alternate randomly hosts and report any problems through alerter in EPESI. Active hosts will redistribute the load if there are problems with fetching. (something like load-balancing)
10. Websites categories should be indexed and saved for future fetching requests. It will have to be done periodically and if any current settings are changed send alert through EPESI to inform admin in order to make changes.
11. Implement on the fly translation in order to match categories and products info being fetched as well matching should be done base on sample of products in the category if identification at point 2 of the req. is not available.
12. Data should be stored in the database for easy retrieval as per EPESI project manager specification
14. Real-time exchange rates updates for different currencies
15. Number of entries to calculate average prices needs to be configurable
16. All Displayed columns has to be sorted.
17. When action initialized:
a)It will collect data once a day as a whole system
b)As a Category (list of products)
c)As a single product
18. Ability to add, remove columns (data)

Pages should have:
1. Manufacturer, Model, Description, vendor, Category, Lowest Price from website (will calculate currency based on the default selected) and actual lowest price, show percentage of difference between the lowest value and the website value
a) Highlight in green the price that is the lowest and in the red that is the highest.
b) Ability to click on the price to go to the particular website’s product page
c) Ability to remove vendors that have unreal prices from website or have very low ratings

2. Manufacturer, Model, Description, Category, Average Price of 5 (configurable) lowest entries from website (using the selected default currency), Average Price of 5 lowest entries using original currency, show percentage of difference between the lowest ave 5 and the highest website price
a) Highlight in green the price that is the lowest and in the red that is the highest
b) Ability to click on the ave. price to go to the particular website’s products page
c) Ability to remove vendors that have unreal prices from website or have very low ratings

3. Product Name, Description, 5 websites with the lowest prices (default currency), price range of 5 lowest websites (default currency), price range of 5 highest websites (default currency), percentage between average of 5 lowest and 5 highest website prices (default currency)
a) Ability to click on price to go to the particular website’s products page
b) Ability to remove vendors that have unreal prices from website or have very low ratings

4. Reporting module to Generate reports from the data stored in the database
a) The reporting module will have to work similar to crystal reports. I can create my own reports and the data would be populated on the website. Charting is not necessary but if it goes with reports it would be ok.
b) Products that have the highest percentage difference

It has to be easy to integrate into a website (modular design) and have admin site to control the configuration.
All parameters used should be configurable as per EPESI module administration
Additional website (price search engine) should be easy to add. The data grabber website should run without locking up, be fast and responsive.
The operation of the module has to be user friendly.

Platform:
EPESI,PHP,AJAX,JAVASCRIPT and MySQL DB

Additional questions.

UPC/EAN matching, MPN matching, SKU matching matching. IF no identification of product found it will be matched by Title.

Title matching should be automatic base on the probability…

1. If All words are present it would be 100% match.
2. If at least 2 words are matched and the rest is not it would be 75% match.
3. If 1 word is present it would be 25% match.
4. No match

The percentage of matching does not matter at this point it would have to be worked out.
The base for text matching is a Title of the product.
Point 1. “Nikon D90″ present everywhere would be 100% match
Point 2. “Nikon D90 body”, “Nikon D90 korpus”, “Nikon D90 kit” 75%
Point 3. “Nikon lens”, “Nikon flash”, “Nikon P80″ 25%
Point 4. None would be left in the repository for matching or deletion.

All 100% matches would be done without manual intervention.
The 75% matches would be shown to the end user as the best suggestion and accepted or not. If not it would have to be matched with the remaining products.
The 25% matches would be shown to the end user as the best suggestion. If not accepted it would have to be matched with the remaining products.
If products is matched it should be remembered.

Example:

Initial project will have 3 websites:

1. www.pricegrabber.com
2. www.idealo.de
3. www.skapiec.pl

Here is the sample of a page for Nikon D90 that info should be pulled and matched,
I think the best way to grab and match the data is based on categories, here is the sample of photo category:

http://cameras.pricegrabber.com/digital/Nikon-D90-Black-SLR-Digital-Body/m90725732.html/search=Nikon%20d90/st=product/sv=title
http://www.idealo.de/preisvergleich/OffersOfProduct/1124693_-d90-nikon.html

http://www.skapiec.pl/site/cat/2/comp/375159

Categories: Ajax, MySQL, PHP, Programming, SQL Tags: , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,

Web Data Extraction

March 23rd, 2009 No comments

Hello, this is an easy web data extraction project. This needs to be done with a data mining/extraction script or software.

Please see PM for detailed information.

Note: I am looking for someone who can do this automated, quickly, and inexpensive.

My budget is $20 USD… First qualified bidder gets the project.

thx/dom

Bear