IDENTIFYING EQUIVALENT JAVASCRIPT EVENTS

    公开(公告)号:CA2786418C

    公开(公告)日:2020-04-14

    申请号:CA2786418

    申请日:2012-08-16

    Abstract: An illustrative embodiment of a computer-implemented process for identifying equivalent JavaScript events receives source code containing two JavaScript events for equivalency analysis, extracts an HTML element containing an event from each JavaScript event and analyzes the extracted HTML elements. Responsive to a determination that the HTML elements are of a same type according to equivalency criteria B, and responsive to a determination that the HTML elements have a same number of attributes according to equivalency criteria C, determines whether JavaScript function calls of each JavaScript event are similar according to equivalency criteria A. Responsive to a determination that the JavaScript function calls are similar according to equivalency criteria A, and responsive to a determination that the other attributes of the HTML elements satisfy equivalency criteria D, identifies the JavaScript events as equivalent.

    IDENTIFICATION OF SEQUENTIAL BROWSING OPERATIONS

    公开(公告)号:CA2789936C

    公开(公告)日:2020-02-18

    申请号:CA2789936

    申请日:2012-09-14

    Abstract: An illustrative embodiment of a computer-implemented process for identifying sequential browsing operations receives session data associated with a plurality of sessions, creates a reduced page for each page in a series of pages associated with a first session in the plurality of sessions and creates a hash value associated with each reduced page for each page in the series of pages associated with the first session of the plurality of sessions. Responsive to a determination that the hash value of the first session is equivalent to the hash value of the second session, the computer-implemented process identifies an associated page as an equivalent page and merges equivalent pages to create a common sequence without a need to resend requests associated with the session data to a server.

    IDENTIFYING EQUIVALENT LINKS ON A PAGE

    公开(公告)号:CA3120833A1

    公开(公告)日:2013-12-26

    申请号:CA3120833

    申请日:2012-06-26

    Abstract: A computer-implemented process for identifying equivalent links on a page responsive to a determination that the crawler has not visited all required universal resource locators, locates a next URL to be crawled to form a current URL and processes the current URL to identify equivalent URLs. Responsive to a determination that the crawler has not visited the current URL, determine whether necessary to crawl all identified equivalent URLs and responsive to a determination that it is necessary to crawl all identified equivalent URLs, adding all equivalent URLs to a list of URLs to be crawled.

    IDENTIFYING CLIENT STATES
    4.
    发明专利

    公开(公告)号:CA2816781C

    公开(公告)日:2022-07-05

    申请号:CA2816781

    申请日:2013-05-28

    Abstract: An illustrative embodiment of a method for identifying client states, receives a set of paths representative of a document object model (DOM) associated with a web page of a rich Internet application and for each path in the set of paths received, extracts a subtree, as Subtree X, for a current path. The method traverses all known sub-paths under the current path and delete corresponding subtrees from Subtree X and reads contents of and determines states of Subtree X to form a State X. The State X is added to a set of current states and responsive to a determination no more paths exist, returns the set of current states of the rich Internet application.

    IDENTIFYING UNVISITED PORTIONS OF VISITED INFORMATION

    公开(公告)号:CA2779235C

    公开(公告)日:2019-05-07

    申请号:CA2779235

    申请日:2012-06-06

    Abstract: An illustrative embodiment for identifying unvisited portions of visited information to visit, receives information to crawl, wherein the information is representative of one of web based information and non-web based information, computes a locality sensitive hash (LSH) value for the received information and identifies a most similar information visited thus far. The illustrative embodiment determines whether the LSH of the received information is equivalent to most similar information visited thus far and responsive to a determination that the LSH of the received information is not equivalent to most similar information visited thus far, identifies a visited portion of the received information using information for most similar information visited thus far and crawls only unvisited portions of the received information.

    IDENTIFYING SESSION IDENTIFICATION INFORMATION

    公开(公告)号:CA2738289C

    公开(公告)日:2018-05-29

    申请号:CA2738289

    申请日:2011-04-28

    Abstract: A computer-implemented process, computer program product, and apparatus for identifying session identification information. A recording is initiated and an operation sequence of interest is performed while recording and the recording ceases. Responsive to a determination that the operation sequence of interest was successful, information from the operation sequence of interest is saved as recorded information and responsive to a determination that a same operation sequence of interest was recorded, the recorded information from each operation sequence of interest is compared. Differences in the recorded information are identified to form identified differences and a session identifier is constructed using the identified differences.

    SYNCHRONIZING HTTP REQUESTS WITH RESPECTIVE HTML CONTEXT

    公开(公告)号:CA2789909C

    公开(公告)日:2019-09-10

    申请号:CA2789909

    申请日:2012-09-14

    Abstract: An illustrative embodiment of a computer-implemented process for synchronizing requests with a respective context, responsive to a determination that there are more pages to explore, performs regular crawling operations for a current page, records a current page in a list of explored pages and extract links from the current page. Responsive to a determination that there are more links to extract, selects a next link to analyze to form a selected link and responsive to a determination that there is a new request associated with the selected link, creates a new request identifier and saves an entry in a hashmap. Responsive to a determination that there is not a new request associated with selected link, updates a request associated with the selected link with a new link value when the link value differs.

    TRACKING JAVASCRIPT ACTIONS
    8.
    发明专利

    公开(公告)号:CA2838911C

    公开(公告)日:2020-05-05

    申请号:CA2838911

    申请日:2014-01-09

    Abstract: An illustrative embodiment of a computer-implemented process for tracking JavaScript actions in a rich Internet application, receives a document object model (DOM) representative of a particular page of an application at a particular time and analyzes the DOM received to identify each JavaScript action on the particular page for which for each JavaScript action identified, a JavaScript action characteristics ID is calculated and stored. Responsive to a determination multiple instances of a same ID exist, collecting a list of JavaScript actions corresponding to each ID corresponding to a multiple JavaScript action and removing from memory JavaScript action entries for the multiple instances of the same ID. A neighbor influence is computed for a member of the list of JavaScript actions remaining and the JavaScript action ID calculated for the member of the list of JavaScript actions remaining is stored. Responsive to a determination there are no more multiple JavaScript actions, return all JavaScript action IDs stored.

    CRAWLING RICH INTERNET APPLICATIONS

    公开(公告)号:CA2790379C

    公开(公告)日:2020-02-25

    申请号:CA2790379

    申请日:2012-09-20

    Abstract: An illustrative embodiment of a computer-implemented process for crawling rich Internet applications executes sets of events discovered in a state exploration phase according to a predetermined priority of each set of events in the sets of events discovered, wherein events from a higher priority are exhausted before an event from a lower priority is executed and responsive to a determination that transitions remain, executes a set of events in a transition exploration phase. The computer-implemented process further determines whether a new state exists as a result of executing an event in the set of events and responsive to a determination that a new state exists, returning to the state exploration phase.

Patent Agency Ranking