Mailinglist Archive: opensuse (6210 mails)
| < Previous | Next > |
Re: [SLE] wget confusion...
- From: Anders Norrbring <lists@xxxxxxxxxxxx>
- Date: Sun, 30 Oct 2005 11:08:27 +0100
- Message-id: <43649B9B.3030005@xxxxxxxxxxxx>
On 2005-10-30 09:48 Per Jessen wrote:
I just don't get it.. Could it be that the source site isn't static? That wget restarts somehow if there are new files somewhere or anything like that. I'm puzzled.
When I look in the wget log, I can see that it has looped files several times. Just as an example, I used the command above to mirror, then if I look in the directory everything is saved in, I have;
host.domain.tld/directory/ (and lots of files and dirs below this)
If I grep the log for 'directory/subdir/file.html' I have this:
--09:16:00-- http://host.domain.tld/directory/file.html.html
=> `host.domain.tld/directory/file.html.html'
Server file no newer than local file `host.domain.tld/directory/file.html.html' -- not retrieving.
=> `host.domain.tld/directory/file.html.html'
Server file no newer than local file `host.domain.tld/directory/file.html.html' -- not retrieving.
=> `host.domain.tld/directory/file.html.html'
Server file no newer than local file `host.domain.tld/directory/file.html.html' -- not retrieving.
=> `host.domain.tld/directory/file.html.html'
Server file no newer than local file `host.domain.tld/directory/file.html.html' -- not retrieving.
=> `host.domain.tld/directory/file.html.html'
Server file no newer than local file `host.domain.tld/directory/file.html.html' -- not retrieving.
=> `host.domain.tld/directory/file.html.html'
Server file no newer than local file `host.domain.tld/directory/file.html.html' -- not retrieving.
Seems like it has looped this specific file a couple of times.
--
Anders Norrbring
Norrbring Consulting
Anders Norrbring wrote:
I'm a bit confused by the behavior of wget... I've issued the command
wget -m -k -K -np http://host.domain.tld/directory/
on a box to backup a site completely, but after what I can see, wget
loops this indefinately, am I right?
I just tried the very same command on http://www.doubledecker.ch/ - no
indefinite looping.
I just don't get it.. Could it be that the source site isn't static? That wget restarts somehow if there are new files somewhere or anything like that. I'm puzzled.
When I look in the wget log, I can see that it has looped files several times. Just as an example, I used the command above to mirror, then if I look in the directory everything is saved in, I have;
host.domain.tld/directory/ (and lots of files and dirs below this)
If I grep the log for 'directory/subdir/file.html' I have this:
--09:16:00-- http://host.domain.tld/directory/file.html.html
=> `host.domain.tld/directory/file.html.html'
Server file no newer than local file `host.domain.tld/directory/file.html.html' -- not retrieving.
=> `host.domain.tld/directory/file.html.html'
Server file no newer than local file `host.domain.tld/directory/file.html.html' -- not retrieving.
=> `host.domain.tld/directory/file.html.html'
Server file no newer than local file `host.domain.tld/directory/file.html.html' -- not retrieving.
=> `host.domain.tld/directory/file.html.html'
Server file no newer than local file `host.domain.tld/directory/file.html.html' -- not retrieving.
=> `host.domain.tld/directory/file.html.html'
Server file no newer than local file `host.domain.tld/directory/file.html.html' -- not retrieving.
=> `host.domain.tld/directory/file.html.html'
Server file no newer than local file `host.domain.tld/directory/file.html.html' -- not retrieving.
Seems like it has looped this specific file a couple of times.
--
Anders Norrbring
Norrbring Consulting
| < Previous | Next > |