-
Notifications
You must be signed in to change notification settings - Fork 2
/
Copy pathapiary.apib
312 lines (240 loc) · 11.3 KB
/
apiary.apib
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
FORMAT: 1A
HOST: https://research.democracy.ovh
# argument-analysis-research
A collection of code used for the research project at ARG-tech aiming for argument analysis and natural language processing
##### About
The purpose of this project is to provide tools and services used in our argument analysis research.
Parts of this research project will happen inside [Colaboratory](https://colab.research.google.com/drive/1iGL_J01I-SAtw2HG8uoJMLgYhYqMzzAK) as well. So this repository is a dependency there and should be treated accordingly.
## Argument Analysis API
Some parts of this repo get combined into a simple JSON API for use in other places (like Colaboratory).
This currently includes segmenting, keyword extraction as well as ADW based segment comparison.
### API Types
#### Document
Documents are the core ingestion format for the argument analysis API.
A document stores content (text) and depending of its state processed results like a list of segments or keywords.
```
{
"hash": "string", // a sha256 over the content field. usually autogenerated by the API
"content": "string", // the documents content. this is the single required field for every document
"segments": [] // array of segments as defined below
}
```
## Segmenting [/argument/segment]
### Single Input [POST /argument/segment]
You may supply a string of text to analyze and get a document containing segements.
+ Request (application/json)
{
"input": "Hello darkness my old friend"
}
+ Response 200 (application/json)
+ Body
{
"content": "Hello darkness my old friend",
"segments": [
{
"text": "Hello darkness my old friend"
}
]
}
### Bulk Input [POST /argument/segment/bulk]
You may supply a list of strings to analyze and get a list of documents each containing segements.
+ Request (application/json)
{
"inputs": [
"Hello darkness my old friend",
"Foo"
]
}
+ Response 200 (application/json)
+ Body
{
"documents": [
{
"content": "Hello darkness my old friend",
"segments": [
{
"text": "Hello darkness my old friend"
}
]
},
{
"content": "Foo",
"segments": [
{
"text": "Foo"
}
]
}
]
}
## Keyword Extraction [/argument/keyword]
For Keyword extraction we currently use RAKE. The endpoint adds found keywords to the passed in segment.
**TODO:** This endpoint requires some love as it is barely used at the moment and should take documents instead of basic input.
### Get a segment with keywords from basic input [POST /argument/keyword]
+ Request (application/json)
{
"input": "Hello darkness my old friend"
}
+ Response 200 (application/json)
+ Body
{
"text": "Hello darkness my old friend",
"keywords": [
{
"key": "old friend",
"value": 4
},
{
"key": "darkness",
"value": 1
}
]
}
## ADW API [/argument/adw]
This almost primitive service wraps around [ADW](https://github.com/pilehvar/ADW) to expose a very simple API for checking text similarity of a json request.
### Get distance for two strings [POST /argument/adw]
+ Request (application/json)
{
"text1": "We should stop eating meat",
"text2": "Don't buy so much stuff"
}
+ Response 200 (application/json)
+ Body
{
"result": 0.507
}
## Linker API [/argument/link]
The Linker API provides a service for checking a set of documents for links based on their segments.
It takes a pre segmented list of documents to find and return found links.
The service itself is async and acts like a database with background processing.
It provides an easy interface that manages multiple linker instances in the background.
If a new instance is created (by sending a linking POST without specifying any linkerID), the new ID gets returned by the service.
To add documents to this instance set the linkerID as shown below.
The linker compares all its stored documents based on their segments doing a two way rating through the rating service defined per linker instance.
### Insert one or many Documents into the Linker [POST /argument/link]
This request either sends to or creates a new linker which stores and handles the provided documents.
The `id` value is optional. If none is set, a new linker is being created and the output will contain the new ID for sending further documents or querying the results.
If `id` is set, the existing linker found under its value will get used.
The `rater` value is optional and defaults to `https://research.democracy.ovh/argument/adw`.
The `threshold` value is the minimal weight two segments need between eachother to get stored as links. It defaults to `0`.
Setting `rater` and `threshold` after the linker was created will have no impact on any existing linkers and get ignored if `id` is set.
+ Request (application/json)
{
"id": "",
"rater": "",
"threshold": 0.25,
"documents": [
{
"content": "Keep encouraging regulations reducing pollutants into our atmosphere.",
"segments": [
{
"text": "Keep encouraging regulations reducing pollutants into our atmosphere."
}
]
},
{
"content": "The eruption of two supervolcanoes would take care of it. At least for a while.",
"segments": [
{
"text": "The eruption of two supervolcanoes would take care of it."
},
{
"text": "At least for a while."
}
]
}
]
}
+ Response 200 (application/json)
+ Body
{"id": "8c2ab258-60dc-4416-9648-edd1efbf3a84"}
+ Response 500 (application/json)
+ Body
{
"error": "error message or object"
}
### List all existing linkers [GET /argument/linkers]
This request returns all the linkers persisted in the linker database.
+ Response 200
+ Body
{
"linkers": [
{
"metadata": {
"id": "c6b5dc9b-48fa-40d1-9d30-a134c045e851",
"rater": "https://research.democracy.ovh/argument/adw",
"threshold": 0.45
}
}
]
}
### List all links found so far [GET /argument/links{?id}{?docs}]
This request queries all documents and links for the provided linker `id`.
To not query all documents, the `docs` parameter can be set to false and the linker won't return them.
+ Parameters
+ id (string) - The linker id to query links from
+ docs (bool, optional) - If set to false, no documents will be returned, just links.
+ Default: `true`
+ Response 200 (application/json)
+ Body
{
"documents": [
{
"hash": "0d53f89084673365c99b79e8e0b0da359d38c341f6886e14dc3aa824080acc10",
"content": "If every human being on the planet dropped dead right now, the worst effects of climate change would still occur. There's no stopping it at this point. Anyone who tries to tell you differently is deluded.",
"segments": [
{
"text": "If every human being on the planet dropped dead right now, the worst effects of climate change would still occur."
},
{
"text": "There's no stopping it at this point."
},
{
"text": "Anyone who tries to tell you differently is deluded."
}
]
},
{
"hash": "af9eca75be519629fef2b63164714ba16cc03a2eee4e7966b2a4d61af563ea4b",
"content": "There is nothing we can do as individuals. No amount of energy saving light bulbs will remove CO2 from the atmosphere. There is nothing we can do as nations. Even human extinction would have no effect. The carbon is there. It's not going anywhere. There is no hope for reversing it. Now I'm sure lots of people who read this will consider me a pessimistic asshole with nothing of value to add to the conversation. Those people are delusional or misinformed and their hopes are based on nothing.",
"segments": [
{
"text": "There is nothing we can do as individuals."
},
{
"text": "No amount of energy saving light bulbs will remove CO2 from the atmosphere."
},
{
"text": "There is nothing we can do as nations."
},
{
"text": "Even human extinction would have no effect."
}
]
},
],
"links": [
{
"source": {
"doc": "af9eca75be519629fef2b63164714ba16cc03a2eee4e7966b2a4d61af563ea4b",
"seg": 3
},
"target": {
"doc": "0d53f89084673365c99b79e8e0b0da359d38c341f6886e14dc3aa824080acc10",
"seg": 0
},
"weight": 0.58415395
},
{
"source": {
"doc": "0d53f89084673365c99b79e8e0b0da359d38c341f6886e14dc3aa824080acc10",
"seg": 0
},
"target": {
"doc": "af9eca75be519629fef2b63164714ba16cc03a2eee4e7966b2a4d61af563ea4b",
"seg": 3
},
"weight": 0.58415395
}
]
}