Topic
  • 2 replies
  • Latest Post - ‏2012-11-28T21:52:24Z by SystemAdmin
SystemAdmin
SystemAdmin
197 Posts

Pinned topic Set "forum thread creation date" as primary date for time series analysis

‏2012-11-28T13:29:18Z |
Hi,

I crawled a web forum using ICA and the Time Series only takes the date of crawling. I tried to search this forum, but was not succesful. How do I take the creation date of the forum thread as the primary date to be shown in the time series?
Updated on 2012-11-28T21:52:24Z at 2012-11-28T21:52:24Z by SystemAdmin
  • bfoyle
    bfoyle
    60 Posts

    Re: Set "forum thread creation date" as primary date for time series analysis

    ‏2012-11-28T18:17:20Z  
    This is standard web crawler behavior. Unless you can get to the database underlying the web forum, you won't be able to pull it in as a simple field.

    What you can do is use ICA Studio to create a normalized date annotator that looks for something like "Posted: Nov 28, 2012 08:29:18 AM" as is the form in this forum and then map that to a field in ICA as posted_date or something like that.

    bf
  • SystemAdmin
    SystemAdmin
    197 Posts

    Re: Set "forum thread creation date" as primary date for time series analysis

    ‏2012-11-28T21:52:24Z  
    • bfoyle
    • ‏2012-11-28T18:17:20Z
    This is standard web crawler behavior. Unless you can get to the database underlying the web forum, you won't be able to pull it in as a simple field.

    What you can do is use ICA Studio to create a normalized date annotator that looks for something like "Posted: Nov 28, 2012 08:29:18 AM" as is the form in this forum and then map that to a field in ICA as posted_date or something like that.

    bf
    Okay, thanks. I thought there might be some more elegant way as identifying the date from the HTML structure of a specific forum.
    But I will try the quick and dirty version by adding an annotator.