[Python] – Code to get unique values of some string attribute in a json.

#!/usr/bin/env python

import re


if __name__ == '__main__':
    lines = [line.rstrip('\n') for line in open('file.json')]
    ids = set()
    for line in lines:
        match_line = re.search('^.+\"attribute\":\"([a-z|A-Z|0-9|-]+)\".+$', line, re.IGNORECASE)
        if match_line is not None:
            ids.add(match_line.group(1))
    for id in ids:
        print(id)

The code will group by and print the unique string values of some attribute.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Google+ photo

You are commenting using your Google+ account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

w

Connecting to %s