<< Prev |
|
HTML
This is the plug gsfdm bound to use often. It donwloads url of types :Connection is - so far - anonymous, with no cookie support, so don't try yahoo's files like.
- "http:", https:".
Donwload use - so far - "wget".
You access options through the plug-in window.
Read next section for parsing options.
This plug is able to parse html files by finding references to url. These files are added to the list of jobs. It first looks whether this is really an html file ( token <html>...<body...>...</body>...</html> ) and after retreives all links ( token <A href="url" ...> ).
It doesn't check for others syntax errors.Actually there are support for :
- Static url : "http://server.com/dir/a_file"
- Relative url : "a_file.html" : files that refer to current document. The plug looks first for a <base> token. This is the reference to use. Else, for automatic parsing, the url of donwload file ( so the file to parse ) is use as reference. If no base token and parse from saved file, these url are discarded.
- Dynamic url : "../../other_dir/other_file" : same rules as relative. NOTA : doesn't check for syntax error, the plug neither can't parse url like "dir/../../other/file".
"Discard Local ... " : if it founds a reference to HD file, urls like "file:/..." or "/dir/...", doesn't add them to the list. "Discard html ..." : This discard files "*.html", "*.htm", "*.shtml", "*.shtm", "*.php", "*.cgi" from the list of files to add. NOTA : if switched on, with automatic parsing, these files are directly parsed. Beware loops...
"
<< Prev |
|