-
Notifications
You must be signed in to change notification settings - Fork 111
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Switch from using String
to using struct
for messages between server and clients
#48
Comments
|
+1 - I suggest to use HTONS & NTOHS in this case as we already do in TCP... Also maybe we do need packed structure here... |
Also, I assume this is part of the same packet structured described here https://github.com/sudomesh/disaster-radio/wiki/Protocol#packet-structure but it doesn't quite match (only 1 byte sequence number, no delimiter) |
Not that I can think of, I think just used uint8_t because it felt more explicit that it was two bytes. And it kept everything in the same type.
I would keep this as uint8_t because, while it is typically a char, it does not, and should not have to always be a char.
The delimiter is a hold-over from the initial design of the chat app. I believe it makes the messages easier to parse in javascript, maybe? I'm in favor of dropping the delimiter and making the messages entirely binary, since I believe that would be the most efficient use of the limited message size.
The data is not necessarily null terminated. I would say that data format definitely depends on the type. This means we would need a buffer+length, if I am not mistaken. Ultimately, I'm not really tied to the original message structure, it is just what I inherited from the chat app design. If I wanted to re-imagine this in the context of the refactored code, I would suggest something like this,
In this new struct, both type and data_length are appended by the transmitting client and should be removed by LoRaLayer2 before transmitting over LoRa because they are redundant as the protocol already has a spot for both of them (if you look at the protocol documentation linked below). I also correctly calculated the length of the message now. It should be a total of 240 bytes (2 byte ID + 238 bytes data)
No, it is not, that is the LoRaLayer2 packet structure, the message structure we are discussing here is for the "application layer" and is contained entirely within the The sequence number there is not equivalent to the msg_ID. The sequence number is meant solely to calculate the metric and is global only to LoRaLayer2, not necessarily to the node, see the paragraph about "packet success" here https://github.com/sudomesh/disaster-radio/wiki/Protocol#metrics The purpose of the msg_ID is (obviously) to identify messages, so if a broken or rouge client keeps transmitting the same message it could be identified and prevented from flooding the node with duplicate message. I'm not sure of the best way of handling the msg_ID in the refactored code. Where would one properly store a global variable that needs to be accessed and updated by all the clients? |
discussion related to the bluetooth development reminded me not to forget about routed packets. Whatsmore, both of these issues got me thinking that I need to document how data encapsulation is done on disaster radio. I wrote up some ideas related to that on the wiki, https://github.com/sudomesh/disaster-radio/wiki/Layered-Model Some of the major divergences in this documentation from the current protocol include,
|
@paidforby void LoRaClient::loop()
{
LL2.daemon();
struct Packet packet = LL2.popFromWSBuffer();
size_t len = packet.totalLength - HEADER_LENGTH;
if (packet.totalLength > HEADER_LENGTH && len > 0)
{
uint16_t msg_id;
char msg[len - 2 + 1] = {'\0'};
if (packet.type != 0x72) {
// parse out message and message id
memcpy(&msg_id, packet.data, 2);
memcpy(msg, packet.data + 2, len - 2);
server->transmit(this, msg, len - 2);
}
else
{
// routing package, has no id
memcpy(msg, packet.data, len);
server->transmit(this, msg, len);
}
}
} But on the Websocket client it still didn't work. So I debugged the Javascript and found that the code expected the routing table to be formatted the same way as a chat message but with an 'r' instead of a 'c'. void WebSocketClient::receive(char *message, size_t len)
{
// TODO: msg id? defaulting to 0 for now
uint16_t msg_id = 0;
unsigned char buf[2 + len + 2] = {'\0'};
memcpy(buf, &msg_id, 2);
if (message[1] != '|')
{
buf[2] = 'r';
buf[3] = '|';
memcpy(&buf[4], message, len);
client->binary(buf, len+4);
}
else
{
memcpy(&buf[2], message, len);
client->binary(buf, len+2);
}
} The workarounds are a little bit flacky, but just to let you know where the problems were created. |
Thanks for that input, I have a merge in the works that will address most of this. I was gonna use a struct of unsigned chars (uint8_t), which is not too different from your char* solution. Getting the routing table to the web app was always a bit hacky. We need to come up with a "better", that is, more consistent way for clients to access the routing table. |
@beegee-tokyo and @tlrobinson I've made some progress toward resolving this issue. I'm working on it in the 1.0.0-rc.1 branch. The updates I've made break backwards compatibility hence the major version increment. This is mostly due to changes in LoRaLayer2, specifically that the nextHop address has been moved into the LL2 packet header and now serves the dual purpose of either routing to a neighbor or telling LL2 that a packet contains routing table information. I've more or less implemented what I outlined in my above comment and the Layered Model wiki post. I've tentatively settled on the following struct for datagrams,
All client receive functions are now required to be,
While the server's transmit function takes the following,
While it is possible to write a client that generates an entire datagram, right now I've maintained legacy support for the web app's confusing So far, I only have the LoRaClient and WebSocketClient working. The rest of the clients are in various states of completion. Bluetooth and OLED are very close to working, while History and Console still have a lot of work to be done. I'd appreciate any help or input y'all might have on this. |
@paidforby |
Here are the current issues listed by decreasing priority,
Feel free to work on any of these. I have very little time to work on this in the next week. |
#56 fixes BleUartClient, OLEDClient and Console. Smaller changes in main.ino as well (wrong message sizes in WelcomeMessage) Will look into History tomorrow. TCPClient is a problem, because I have no idea how to test as well. |
Added a lot more changes to #56 . Lot's of changes. You can cherry-pick whatever you like. Updated
New
|
TelnetClient (TCPCLient) is working as well. I made a workaround, just for testing, and it worked well under both Console and TelnetClient. RoutingTableEntry *getRoutingTable(int index); In LoRaLayer2.cpp RoutingTableEntry *LL2Class::getRoutingTableEntry(int index)
{
return &_routeTable[index];
} And in Console.cpp ...
else if ((strncmp(&args[0][1], "lora", 4) == 0))
{
RoutingTableEntry *tblPtr;
int routeTableSize = LL2.getRouteEntry();
Serial.printf("RoutingTableSize = %d", routeTableSize);
Datagram loraInfo;
memset((char *)loraInfo.message, 0, 233);
memcpy(loraInfo.destination, LL2.broadcastAddr(), ADDR_LENGTH);
loraInfo.type = 'c';
size_t msgLen = 0;
strcat((char *)loraInfo.message, "Address: ");
msgLen += 9;
uint8_t *localAdr = LL2.localAddress();
char tempBuf[64];
for (int i = 0; i < ADDR_LENGTH; i++)
{
msgLen += sprintf(tempBuf, "%02X", localAdr[i]);
strcat((char *)loraInfo.message, tempBuf);
}
strcat((char *)loraInfo.message, "\n");
msgLen++;
msgLen += sprintf(tempBuf, "Routing Table: total routes %d\r\n", LL2.getRouteEntry());
strcat((char *)loraInfo.message, tempBuf);
for (int i = 0; i < routeTableSize; i++)
{
tblPtr = LL2.getRoutingTableEntry(i);
msgLen += sprintf(tempBuf, "%d hops from ", tblPtr->distance);
strcat((char *)loraInfo.message, tempBuf);
for (int j = 0; j < ADDR_LENGTH; j++)
{
msgLen += sprintf(tempBuf, "%02X", tblPtr->destination[j]);
strcat((char *)loraInfo.message, tempBuf);
}
strcat((char *)loraInfo.message, " via ");
msgLen += 5;
for (int j = 0; j < ADDR_LENGTH; j++)
{
msgLen += sprintf(tempBuf, "%02X", tblPtr->nextHop[j]);
strcat((char *)loraInfo.message, tempBuf);
}
msgLen += sprintf(tempBuf, " metric %3d ", tblPtr->metric);
strcat((char *)loraInfo.message, tempBuf);
strcat((char *)loraInfo.message, "\n");
msgLen++;
}
client->receive(loraInfo, msgLen + DATAGRAM_HEADER);
// TODO: print to client instead of serial
// Serial.printf("Address: ");
// LL2.printAddress(LL2.localAddress());
// LL2.printRoutingTable();
}
... |
@beegee-tokyo Thanks for all of this, lots of great additions. I finally got sometime to look through most of it. As I mentioned in the PR, the OLED wasn't working for me. It appears that the datagram message was not being converted to a string successfully? I fixed this issue in recent commit to I am also seeing issues with the Serial Console and I have yet to test the Telnet client. I will continue debugging and post updates here. |
Hey all, I recently found https://www.pjon.org/ and it looks like they've come up with a pretty brilliant, super efficient binary format that they use for all mediums: UDP over the internet, data over serial, data over LoRa, etc.
...
Perhaps I am misunderstanding and you are talking about app-specific data structures that are independent of the underlying medium, but I just discovered PJON and am excited about the possibilities of being able to standardize communication using an open protocol and efficient data format suitable for any kind of network, no matter how decentralized or heterogeneous. |
|
Thanks for pointing this out! Looks like a really awesome project and similar to our routing library, https://github.com/sudomesh/LoRaLayer2/. However, it looks like the microcontroller we are using, the Espressif ESP32, is not listed as supported for PJON Through LoRa. Who knows if they've just haven't tested it or if there is an actual hardware limitation (memory is pretty limited on the ESP32). And you are somewhat misunderstanding this thread. In this thread we are talking about the structure of messages sent between "clients" that are internal to the firmware (more Layer3/4 than Layer2). Not sure if PJON could be used to do this also, but the system we have now is pretty good once we clean up this String-struct conversion issue. You maybe interested in another discussion #57 where folks are talking about possible alternatives to LoRaLayer2. It's not clear to me how PJON would handle mesh routing. I'll keep my eye on PJON and see if they get support for the ESP32, then I would definitely consider it as an alternative Layer 2/Layer 3 protocol. |
Looking at their github, https://github.com/gioblu/PJON, I see ESP32 listed as supported. But maybe that's just for general PJON and not PJON Through LoRa? Not sure, more research is needed. |
@paidforby Compatibility table for ESP32: https://github.com/gioblu/PJON/wiki#compatibility . Examples here but no LoRa: https://github.com/gioblu/PJON/tree/master/examples/ESP32/ |
Yup, I just found that also. Looks like Through LoRa is not supported for the ESP32, wonder why? |
Hmm you're right. DualUDP and GlobalUDP are supported! Which is so cool, but... unless the ESP32 uses a gateway (ESP32 =(UDP)=> gateway =(whatever)=> LoRa), looks like this won't work right now. Bummer! |
Good question; I just asked about this to find out more: gioblu/PJON#342 . |
Back to the task at hand. In the recent commit e227e86, I heavily refactored the Console middleware, which appeared to be broken to me. I reintroduced the banner from the pre-client/server console and added a "prompt" showing the current username which mimics the output of the WebSocket and OLED display, though I'm interested in having a more bash-like Next, I'm going to take a look at the TCP client and refactor where necessary. After that, I need to look at the History, which does not appear to be working, but I can't say I've completely tested it. |
So I think I finally have almost everything working. Telnet required a little refactoring, but mostly I just had to make some improvements to the Console middleware and make up for inconsistencies between the Stream client and the Telnet client. @beegee-tokyo I resolved the History appears to be working, but I'm having trouble getting a really good test. There are some changes/improvements I want to make to the History feature, but they are mostly unrelated to this issue. I'm going to close this issue, since I believe it has served its purpose and all the necessary changes have been made on the 1.0.0-rc.1 branch. I should be merging the 1.0.0-rc.1 branch into master soon, at which point this issue will be truly resolved. |
Related to recent PR #44
internal server-client communication currently uses a
String
, this is unsafe and problematic. Instead, it should use astruct
that looks something likewsMessage
from the pre-refactor code,The text was updated successfully, but these errors were encountered: