Issue 3342 - RFE logconv.pl should have a replacement in CLI tools · jchapma/389-ds-base@c114f06

progier389 · 2024-12-02T18:40:42Z

Hi James:

A few remarks about the general architecture:

Having an huge function is difficult to read, imho you should better split the if key == /elif by using an array of functions:
Splitting the parsing and the statistics computation allow to reuse the fonction when parsing json completed operation object

so in short it will looks like:

        'RESULT_REGEX': ((re.compile(r'''
                           ...
                        ''', re.VERBOSE), process_result_line),

pending_conns = {}   # A dict of: conn_id --> dict whose 'ops' member is a dict of  opids -> concatenation of matching groups with same opid/connid (so that we can delete all ops associated with the connection (when closing the connection) with del pending_conns[conn_id])   #i.e value is now a tuple of regex, function to process it



    def match_line(self, line, bytes_read):
        for key, (pattern, action) in self.regexes.items():
            match = pattern.match(line)
            if not match:
                continue
            try:
                groups = match.dictgroup()
                # datetime library doesnt support nano seconds so we need to "normalize" the timestamp
                 ...
                groups['norm_timestamp'] = norm_timestamp
                action(self, groups)
            except IndexError as exc:
                printf(f'Access log line {line} is probably truncated: {exc}')
                return

def pronn_id = groups['conn_id']
        op_id = groups['op_id']
        try:
            conn = pending_conns[conn_id];
            op = conn['ops'][op_id]
        except KeyError:
            # Operation is not present (probably around thee start of the log file)
            return
        op.update(groups)
        process_result_statistics(op)
        del conn['ops'][op_id}']

def process_search_line(self, groups):
        conn_id = groups['conn_id']
        op_id = groups['op_id']
        try:
            conn = pending_conns[conn_id];
        except KeyError:
             conn = { 'conn_id': conn_id, 'ops': {} }
        op = groups
          # remainder of search processing (updating op)
        conn['op_id'] = op

def process_result_statistics(self, op)
        # That could be resused when finding json object for completed operation
        Then you have most of the code after
                 if key == 'RESULT_REGEX':
        (replacing: all match.group('x') per op['x'] )onn_id = groups['conn_id']
        op_id = groups['op_id']
        try:
            conn = pending_conns[conn_id];
            op = conn['ops'][op_id]
        except KeyError:
            # Operation is not present (probably around thee start of the log file)
            return
        op.update(groups)
        process_result_statistics(op)
        del conn['ops'][op_id}']

def process_search_line(self, groups):
        conn_id = groups['conn_id']
        op_id = groups['op_id']
        try:
            conn = pending_conns[conn_id];
        except KeyError:
             conn = { 'conn_id': conn_id, 'ops': {} }
        op = groups
          # remainder of search processing (updating op)
        conn['op_id'] = op

def process_result_statistics(self, op)
        # That could be resused when finding json object for completed operation
        Then you have most of the code after
                 if key == 'RESULT_REGEX':
        (replacing: all match.group('x') per op['x'] )

except that it looks fine.
I like the use of heapq

jchapma · 2024-12-09T23:48:01Z

Hi @progier389

I think this is an excellent idea.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Commit

There are no files selected for viewing

2 comments on commit `c114f06`

progier389 commented on `c114f06` Dec 2, 2024

jchapma commented on `c114f06` Dec 9, 2024

Commit

There are no files selected for viewing

2 comments on commit c114f06

progier389 commented on c114f06 Dec 2, 2024

Choose a reason for hiding this comment

jchapma commented on c114f06 Dec 9, 2024

Choose a reason for hiding this comment

2 comments on commit `c114f06`

progier389 commented on `c114f06` Dec 2, 2024

jchapma commented on `c114f06` Dec 9, 2024