php to spider a website

Ask a Question related to PHP Development, Design and Development.

  1. #1

    Default php to spider a website

    I am looking for a script that I can use to spider a website, and then pull
    the images... I know how to do it for a single page, but, I would like to be
    able to do this for the entire site. Any suggestions?

    Thanks,
    Kyle Mizell
    [url]http://www.pimpinonline.com[/url]


    Kyle Mizell Guest

  2. Similar Questions and Discussions

    1. Verity Spider license
      Who do I purchase an external license for the verity spider from? Coldfusion or Verity? Thanks Ben
    2. Web Spider Component
      Does anyone know of a good web spidering component available to crawl and extract llinks in coldfusion?
    3. Question regarding Verity Spider
      I am using Verity Spider to index pages of one of our websites, and everything seems to be going ok. However, if you type in a search keyword in...
    4. php web spider
      Hi I have a device (a wireless access point) wich is configurable using its internal web server. I did a little php script that opens a socket to...
    5. Spider Web Effect
      Hi All I am trying to create a spider web effect and am not having much luck... any ideas. It can be lieteral or stylised as long as it suggest a...
  3. #2

    Default Re: php to spider a website

    "Kyle Mizell" <kyle@pimpinonline.comNOSPAM> wrote in message
    news:qewyb.174752$Dw6.686810@attbi_s02...
    > I am looking for a script that I can use to spider a website, and then
    pull
    > the images... I know how to do it for a single page, but, I would like to
    be
    > able to do this for the entire site. Any suggestions?
    >
    > Thanks,
    > Kyle Mizell
    > [url]http://www.pimpinonline.com[/url]
    >
    >
    >
    I don't know about your question, but pimpinonline.com is awesome.


    jn Guest

  4. #3

    Default Re: php to spider a website

    Kyle Mizell wrote:
    > I am looking for a script that I can use to spider a website, and then pull
    > the images... I know how to do it for a single page, but, I would like to be
    > able to do this for the entire site. Any suggestions?
    Why php? Use wget if all you want is a somple spider job.

    Kevin Thorpe Guest

  5. #4

    Default Re: php to spider a website

    On Mon, 01 Dec 2003 00:49:26 GMT, "Kyle Mizell" <kyle@pimpinonline.comNOSPAM>
    wrote:
    >I am looking for a script that I can use to spider a website, and then pull
    >the images... I know how to do it for a single page, but, I would like to be
    >able to do this for the entire site. Any suggestions?
    PHP has HTTP client functions; you can simply use file() with a URL.

    However, to extract information from the HTML, you need an HTML parser
    (regular expressions alone are not sufficient). PHP doesn't have one built in
    or as one of the standard extensions. Personally I'd use Perl for this (e.g.
    HTML::Parser). I think there is an HTML parser for PHP called HTML-Sax, have a
    search for that.

    --
    Andy Hassall (andy@andyh.co.uk) icq(5747695) ([url]http://www.andyh.co.uk[/url])
    Space: disk usage analysis tool ([url]http://www.andyhsoftware.co.uk/space[/url])
    Andy Hassall Guest

  6. #5

    Default Re: php to spider a website

    "Kyle Mizell" <kyle@pimpinonline.comNOSPAM> wrote in message news:<qewyb.174752$Dw6.686810@attbi_s02>...
    > I am looking for a script that I can use to spider a website, and then pull
    > the images... I know how to do it for a single page, but, I would like to be
    > able to do this for the entire site. Any suggestions?
    >
    > Thanks,
    > Kyle Mizell
    > [url]http://www.pimpinonline.com[/url]
    As you do for one page do for all your pages.
    In one array store all links foud on first page (eliminate
    duplicates), then do for all this pages as for first page.
    I think the beset is to make function, which save one page and return
    found links, then call your function with all urls.
    While you are saving a page you have to replace links because static
    names will be diferent
    i.e.
    members.php?search_sex=Male&search=kyle@pimpinonli ne.com&unset_search=true
    replace with
    members_php_search_sex_Male_search_kyle_pimpinonli ne_com_unset_search_true.HTML

    and so name all stored pages.

    enjoy
    lazo Guest

Posting Permissions

  • You may not post new threads
  • You may post replies
  • You may not post attachments
  • You may not edit your posts

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139