Results 1 to 8 of 8

Thread: Parsing html ... (yes, i said it)

  
  1. #1
    modsyn is offline -Hacks Guru
    Join Date
    Aug 2005
    Location
    Shinigami Kurosaki Ichigo!
    Posts
    2,475
    Rep Power
    17

    Default

    hey everyone. i've been working on a project by myself for the last couple
    of days (or more). i'm trying to parse html using lua tables. anyway, i
    think i've got it working (somewhat) for single tags, but i'm running into
    problems with nested tags. and even the simplest of pages have multiple
    nested tags (ie, <body> tags,tags,tags!! </body>)

    --code was removed because it's already changed beyond recognition...

    i was hoping to get some help later on in this project, especially when it
    comes to the display. i've been doing backend stuff with net connections
    and trying this parsing, but it will take a lot of work to write all the code
    for displaying the different types of tags. so, i plan on doing the major
    ones first (like <a>, <img>,
    ) and just adding in as many of the
    rest of them as time allows.

    maybe i'm shooting for the moon here... if anyone would like to be a part
    of this or would like to give me more reasons to scrap the project, just
    let me know. i'd really like to have some kind of a web browser that
    allows for downloads on 1.5 even if it's text-based and buggy...

    '??'

    ps, that code is really sloppy and is in dire need of saving.

    jMEnc Guide, jMEnc2 page - by the way, you smell nice

  2. #2
    LordCthulu is offline Senior Member -Hacks Enthusiast
    Join Date
    Mar 2005
    Posts
    578
    Rep Power
    15

    Default

    if anyone would like a copy of these libs let me know and
    i'll post links to the current version of them.
    Sure i'd like to check them out. You have networking libraries?
    I'd offer my help if only I knew how normal browsers parse HTML.
    Oldest psp-hacks member ever.

  3. #3
    modsyn is offline -Hacks Guru
    Join Date
    Aug 2005
    Location
    Shinigami Kurosaki Ichigo!
    Posts
    2,475
    Rep Power
    17

    Default

    ok, i just got the download speed at a comfortable place. i tested it versus
    someone using the 2.0 broswer and they were about even (i actually won).

    here's the link to the collection.

    as far as the html parsing goes, i'm making some progress and the above
    code is now irrelevant. so, i'll edit my post accordingly.

    if anyone has good functions that they would like to be added to the collection
    just post them here or PM me and i'll get them in there.

    :)
    jMEnc Guide, jMEnc2 page - by the way, you smell nice

  4. #4
    illfoundedmind is offline -Hacks Enthusiast
    Join Date
    Nov 2005
    Location
    WTF~~~~~~~~~~> Rank: %NULL
    Posts
    389
    Rep Power
    14

    Default

    Bitch work... nah all set, good luck though :8)
    july 19

  5. #5
    modsyn is offline -Hacks Guru
    Join Date
    Aug 2005
    Location
    Shinigami Kurosaki Ichigo!
    Posts
    2,475
    Rep Power
    17

    Default

    it's cool. i guess since nobody wants the fame of working on the browser
    project that it'll be all mine... j/p
    jMEnc Guide, jMEnc2 page - by the way, you smell nice

  6. #6
    illfoundedmind is offline -Hacks Enthusiast
    Join Date
    Nov 2005
    Location
    WTF~~~~~~~~~~> Rank: %NULL
    Posts
    389
    Rep Power
    14

    Default

    Dam I wish I could help out, hell I'd be stupid not to take this opportunity to work along side the great modsyn :8) (okay so I just want to see this app get written)
    july 19

  7. #7
    modsyn is offline -Hacks Guru
    Join Date
    Aug 2005
    Location
    Shinigami Kurosaki Ichigo!
    Posts
    2,475
    Rep Power
    17

    Default

    i just decided to scrap the 'browser' project and incorporate some of its
    features into linker. everyone who said it was too much work was right.
    jMEnc Guide, jMEnc2 page - by the way, you smell nice

  8. #8
    romero126 is offline -Hacks Neophyte
    Join Date
    Jan 2006
    Posts
    81
    Rep Power
    14

    Default

    Quote Originally Posted by modsyn
    i just decided to scrap the 'browser' project and incorporate some of its
    features into linker. everyone who said it was too much work was right.
    Good idea trying to make a full blown browser in a scripting language is bitch work. Not tomention you will never recieve the desired results with it since after you begin working with multiple elements your HTML parser/downloader begins to lag behind and generally slow to a grinding halt.

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •