Mailinglist Archive: opensuse (6210 mails)
| < Previous | Next > |
Re: [SLE] wget confusion...
- From: Randall R Schulz <rschulz@xxxxxxxxx>
- Date: Sun, 30 Oct 2005 07:26:08 -0700
- Message-id: <200510300626.09048.rschulz@xxxxxxxxx>
Anders,
On Saturday 29 October 2005 23:19, Anders Norrbring wrote:
> I'm a bit confused by the behavior of wget... I've issued the command
>
> wget -m -k -K -np http://host.domain.tld/directory/
>
> on a box to backup a site completely, but after what I can see, wget
> loops this indefinately, am I right?
Is it possible that the modification time returned is not that of the
underlying files, but rather the current time? I think this will
subvert wget's cycle-breaking logic, 'cause when it comes around to
download a given file a second time and it appears to have changed
(later mod time than the previous download), it will retrieve that file
again and the files it refers to again. The result: An infinite loop.
> My intention was that it should do one round, and then exit, but it
> seems like it just goes on...
Normally it works, as long as the graph formed by the hyperlinks is
bounded, either intrinsically or because you gave options that cut off,
say, links that go off-site or outside the hierarchy at which you
initiated the retrieval.
> --
>
> Anders Norrbring
Randall Schulz
On Saturday 29 October 2005 23:19, Anders Norrbring wrote:
> I'm a bit confused by the behavior of wget... I've issued the command
>
> wget -m -k -K -np http://host.domain.tld/directory/
>
> on a box to backup a site completely, but after what I can see, wget
> loops this indefinately, am I right?
Is it possible that the modification time returned is not that of the
underlying files, but rather the current time? I think this will
subvert wget's cycle-breaking logic, 'cause when it comes around to
download a given file a second time and it appears to have changed
(later mod time than the previous download), it will retrieve that file
again and the files it refers to again. The result: An infinite loop.
> My intention was that it should do one round, and then exit, but it
> seems like it just goes on...
Normally it works, as long as the graph formed by the hyperlinks is
bounded, either intrinsically or because you gave options that cut off,
say, links that go off-site or outside the hierarchy at which you
initiated the retrieval.
> --
>
> Anders Norrbring
Randall Schulz
| < Previous | Next > |