This page (revision-1) was last changed on 29-Nov-2024 16:16 by UnknownAuthor

Only authorized users are allowed to rename pages.

Only authorized users are allowed to delete pages.

Page revision history

Version Date Modified Size Author Changes ... Change note

Page References

Incoming links Outgoing links

Version management

Difference between version and

At line 1 added 19 lines
!!! Overview
[{$pagename}] (or [Screen Scraping]) is a technique in which a computer program extracts [data] from human-readable output coming from another program without the use of an [Application Programming Interface] ([API]) or [Software Development Kit] ([SDK]).
[{$pagename}] is distinguished from regular parsing as the output being obtained was intended for display to an end-user, rather than as input to another program, and is therefore usually neither documented nor structured for convenient parsing.
[{$pagename}] often involves ignoring [binary] data (usually images or multimedia data), display formatting, redundant labels, superfluous commentary, and other information which is either irrelevant or hinders automated processing.
[{$pagename}] is subject to failure if the screen output changes.
[{$pagename}] is often part of a [Data Extraction Transformation] process.
!! [Web scraping]
[Web scraping] is [{$pagename}] used for extracting [data] from [websites].
!! More Information
There might be more information for this subject on one of the following:
[{ReferringPagesPlugin before='*' after='\n' }]
----
* [#1] - [Data_scraping|Wikipedia:Data_scraping|target='_blank'] - based on information obtained 2017-12-08