Abstract:
Systems and techniques are described for a path maximum transmission unit (MTU) discovery method that allows the sender of IP packets to discover the MTU of packets that it is sending over a conduit to a given destination. The MTU is the largest packet that can be sent through the network along a path without requiring fragmentation. The path MTU discovery method actively probes each sending path of each conduit with fragmentation enabled to determine a current MTU and accordingly increase or decrease the conduit MTU. The path MTU discovery process is resilient to errors and supports retransmission if packets are lost in the discovery process. The path MTU discovery process is dynamically adjusted at a periodic rate to adjust to varying network conditions.
Abstract:
A system and method for batch monitoring of performance of an information technology system are provided. The method includes configuring log collection, email and incidents of a batch job using a configuration manager, capturing a start of a batch job and event log and stream the captured data to an analytical engine, predicting an end time for the batch job and initiate the collection of a plurality of environment data including the CPU and memory data at a pre-determined frequency, increasing the frequency of the data collection, and generating alert to the concerned personnel of the client device and subsequently initiating the troubleshooting procedure when a slowness in the data collection is detected by the analytical engine, completing the troubleshooting procedure by the analytical engine when an auto-correction mechanism is enabled, and generating a report for historical reference and storing the generated report in a database for further references.
Abstract:
An extraction method comprising: acquiring configuration information indicating a configuration of a processing device for each generation that represents a respective time period between configuration modifications, and extracting first information indicating configuration modifications between generations based on a plurality of generations of the acquired configuration information; acquiring performance information indicating various types of performance of the processing device at respective times, and extracting, as second information associated with one of the generations, a time when an abnormality was detected in change to a time series of the acquired performance information; acquiring event information indicating an event occurring in the processing device at respective times, and extracting, as third information associated with one of the generations, a time of event information creation when an abnormality was detected; and extracting the first information, the second information, and the third information that have a relationship to each other.
Abstract:
Logging includes: obtaining an event designating message (EDM) comprising: a set of keys, or a type of designation corresponding to the set of keys, or both; looking up a set of values associated with the set of keys in a key-value store storing a plurality of keys and their corresponding values; constructing an event using the set of keys and values corresponding to the set of keys; and outputting the constructed event.
Abstract:
A computer-implemented method of reducing impact of performance data gathering on execution of instrumented code comprises gathering respective performance data for each of a plurality of routines in the instrumented code over a plurality of execution instances of the instrumented code. The method also comprises determining that the gathered performance data for the at least one selected routine follows an identified pattern for at least one selected routine from the plurality of routines and, in response to determining that the at least one selected routine follows an identified pattern, removing instrumentation associated with the at least one selected routine.
Abstract:
A system for identifying abnormal resource usage in a data center is provided. In some embodiments, the system employs a prediction model for each of a plurality of resources and an abnormal resource usage criterion. For each of a plurality of resources of the data center, the system retrieves current resource usage data for a current time and past resource usage data for that resource. The system then extracts features from the past resource usage data for that resource, predicts using the prediction model for that resource usage data for the current time based on the extracted features, and determines an error between the predicted resource usage data and the current resource usage data. After determining the error data for the resources, the system determines whether errors satisfy the abnormal resource usage criterion. If so, the system indicates that an abnormal resource usage has occurred.
Abstract:
Systems and methods for monitoring the performance associated with fulfilling resource requests and determining optimizations for improving such performance are provided. A processing device obtains and processes performance information associated with processing a request corresponding to two or more embedded resources. The processing device uses the processed performance information to determine a consolidation configuration to be associated with a subsequent request for the content associated with the two or more embedded resources. In some embodiments, in making such a determination, the processing device assesses performance information collected and associated with subsequent requests corresponding to the content associated with the two or more embedded resources and using each of a variety of alternative consolidation configurations. Aspects of systems and methods for generating recommendations to use a particular consolidation configuration to process a subsequent request corresponding to the content associated with the two or more embedded resources are also provided.
Abstract:
Embodiments for integrating production support features and recording client events are included in systems that initiate recording of the events occurring on a client application. The systems further receive event data transmitted from the client application, provide the event data on a display of a device of a user in real time, and allow the user to detect issues associated with the client application. The systems are combinable with additional production support features including module tracing.
Abstract:
A method, system and a computer program product for managing a computing system crash. Memory of the computing system is separated into at least two classifications, referred to herein as a dumpable area and a non-dumpable area. Upon detection of an operating system crash in the computing system, an operating system module prevents a dumping operation of the memory, including preventing access to the dumpable memory area, and divides the non-dumpable area into a new dumpable area and a new non-dumpable area. At such time as the operating system is rebooted, the dumping operation is initiated and completed in the dumpable area, and resumed operations use the non-dumpable area.
Abstract:
A hybrid cloud environment is provided where a user of cloud computing services is disposed to consume cloud provided services delivered by each of a plurality of cloud computing service providers. Specified event data is received at an MCS component. The event data pertains to metering events related to the cloud provided services delivered by one or more cloud service providers. Event data include event data furnished by one or more entities that actively monitor metering events at one or more local or remote cloud service provider locations. Event data are sent from the MCS component to a metering system. The metering system generates reports from the event data that contains usage information on services provided by one or more cloud service providers. Information for a provider specifies amounts and quality of each type of service delivered to users by the provider.