Skip to content
This repository has been archived by the owner on May 12, 2024. It is now read-only.

Interests divorced from benefactor #9

Open
psychemedia opened this issue Sep 28, 2022 · 0 comments
Open

Interests divorced from benefactor #9

psychemedia opened this issue Sep 28, 2022 · 0 comments

Comments

@psychemedia
Copy link

psychemedia commented Sep 28, 2022

Many of the interests listed are divorced from the party that provided the benfit example query

For example:

item date person_id name
On 30 January 2013 I was appointed as a non-executive director of the Social Investment Business Group, Address 1st Floor, Derbyshire House, St Chad's Street, London WC1H 8AG. 2013-11-25 uk.org.publicwhip/person/10051 Directorships
27 February 2013, received £364.11. Hours: 12 hrs. (Registered 13 March 2013) 2013-11-25 uk.org.publicwhip/person/10051 Directorships
27 March 2013, received £333.34. Hours: 12 hours (estimated).  (Registered 3 June 2013) 2013-11-25 uk.org.publicwhip/person/10051 Directorships
26 April 2013 received £333.34. Hours: 12 hours (estimated).  (Registered 3 June 2013)
-- -- -- --

There are multiple payments, presumably from the first stated organisation (assuming we can trust the sort order to re-present the original grouped items in the correct sequence? Or is there a better sort strategy?)

I also note:

  • there may be duplicate entries?
  • it would be useful to pull out the (Registered DATE) element;
  • entity extraction (eg spacy) could be used to extract companies, addresses, amounts, etc.
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant