[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: broken links on linuxdoc.org

To: [email protected]
Subject: Re: broken links on linuxdoc.org
From: Jesse Goerz <[email protected]>
Date: Thu, 2 Nov 2000 17:42:13 -0500
Cc: David Merrill <[email protected]>
In-reply-to: <[email protected]>
References: <[email protected]> <[email protected]> <[email protected]>
Resent-date: Thu, 2 Nov 2000 17:52:08 -0500 (EST)
Resent-from: [email protected]
Resent-message-id: <76JGjC.A.6LB.tCfA6@murphy>
Resent-sender: [email protected]

GNU wget will do this for you.  Here are the options 

 wget --spider --force-html -i Howto-Howto.html

>From the man page:
        --spider
              When invoked with this option, Wget will behave  as
              a  Web "spider", which means that it will not down�
              load the pages, just check  that  they  are  there.
              You can use it to check your bookmarks...
    
       -F --force-html
              When input is read from a  file,  force  it  to  be
              HTML.  This  enables you to retrieve relative links
              from existing HTML files on  your  local  disk,  by
              adding  <base href="URL"> to HTML, or using --base.     

        -i filename --input-file=filename
              Read  URL-s  from  filename, in which case no URL-s
              need to be on the command line. If there are  URL-s
              both  on  the command line and in a filename, those
              on the command line are first to be retrieved.  The
              filename  need not be an HTML document (but no harm
              if it is) - it is enough  if  the  URL-s  are  just
              listed sequentially.

              However,  if you specify --force-html, the document
              will be regarded as HTML. In that case you may have
              problems  with  relative links, which you can solve
              either by adding <base href="url"> to the  document
              or by specifying --base=url on the command-line. 

I guess as long as the links are absolute links it should be just fine :-)

Jesse 


On Thu, 02 Nov 2000, David Merrill wrote:
> Greg Ferguson wrote:
> > 
> > On Nov 1, 11:05pm, Gerald Oskoboiny wrote:
> > > Subject: broken links on linuxdoc.org
> > 
> > This has been corrected. If anyone finds or know of other broken
> > links (or potential broken links) such as this, let us know so
> > we can put the redirects in place.
> 
> It wouldn't be too hard to write a script to verify URLs in the HTML
> versions posted online, right? I know I wrote such a script for my
> employer last year. Unfortunately, I no longer have it. :(
> 
> This would catch these types of errors before they become a problem for
> users.
> 
> -- 
> David C. Merrill, Ph.D.
> Linux Documentation Project
> Collection Editor & Coordinator
> www.LinuxDoc.org
> 
> 
> --  
> To UNSUBSCRIBE, email to [email protected]
> with a subject of "unsubscribe". Trouble? Contact [email protected]
-- 
Got freedom?
http://www.debian.org


--  
To UNSUBSCRIBE, email to [email protected]
with a subject of "unsubscribe". Trouble? Contact [email protected]

References:
- broken links on linuxdoc.org
  - From: Gerald Oskoboiny <[email protected]>
- Re: broken links on linuxdoc.org
  - From: "Greg Ferguson" <[email protected]>
- Re: broken links on linuxdoc.org
  - From: David Merrill <[email protected]>

Prev by Date: RE: Sports Goods [no more spam]
Next by Date: C-Source Code as documentation (was Re: Doxygen use)
Previous by thread: Re: broken links on linuxdoc.org
Next by thread: Sports Goods
Index(es):
- Date
- Thread