html_url,issue_url,id,node_id,user,created_at,updated_at,author_association,body,reactions,issue,performed_via_github_app https://github.com/simonw/datasette/issues/506#issuecomment-500903287,https://api.github.com/repos/simonw/datasette/issues/506,500903287,MDEyOklzc3VlQ29tbWVudDUwMDkwMzI4Nw==,9599,2019-06-11T15:48:27Z,2019-06-11T15:48:27Z,OWNER,"Calling out to Tika does make me a little nervous, but that's why Datasette has plugins! A plugin that calls Tika (and caches the results) could be really interesting.","{""total_count"": 0, ""+1"": 0, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",453846217, https://github.com/simonw/datasette/issues/506#issuecomment-500238035,https://api.github.com/repos/simonw/datasette/issues/506,500238035,MDEyOklzc3VlQ29tbWVudDUwMDIzODAzNQ==,1059677,2019-06-09T19:21:18Z,2019-06-09T19:21:18Z,NONE,"If you don't mind calling out to Java, then Apache Tika is able to tell you what a load of ""binary stuff"" is, plus render it to XHTML where possible. There's a python wrapper around the Apache Tika server, but for a more typical datasette usecase you'd probably just want to grab the Tika CLI jar, and call it with `--detect` and/or `--xhtml` to process the unknown binary blob","{""total_count"": 0, ""+1"": 0, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",453846217, https://github.com/simonw/datasette/issues/506#issuecomment-500224864,https://api.github.com/repos/simonw/datasette/issues/506,500224864,MDEyOklzc3VlQ29tbWVudDUwMDIyNDg2NA==,9599,2019-06-09T16:07:39Z,2019-06-09T16:07:39Z,OWNER,"Shipped 0.1 of the plugin! I'm pretty happy with this display format: ","{""total_count"": 0, ""+1"": 0, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",453846217, https://github.com/simonw/datasette/issues/506#issuecomment-500220862,https://api.github.com/repos/simonw/datasette/issues/506,500220862,MDEyOklzc3VlQ29tbWVudDUwMDIyMDg2Mg==,9599,2019-06-09T15:25:18Z,2019-06-09T15:26:23Z,OWNER,I'm going to call this `datasette-render-binary`: https://github.com/simonw/datasette-render-binary,"{""total_count"": 0, ""+1"": 0, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",453846217, https://github.com/simonw/datasette/issues/506#issuecomment-500220646,https://api.github.com/repos/simonw/datasette/issues/506,500220646,MDEyOklzc3VlQ29tbWVudDUwMDIyMDY0Ng==,9599,2019-06-09T15:22:12Z,2019-06-09T15:22:12Z,OWNER,"![3C9CCDBA-F346-47CB-BFEC-964B0426E728](https://user-images.githubusercontent.com/9599/59160835-789ca900-8a8f-11e9-9767-0f50890d17fe.jpeg) New idea: show essentially this but differentiate the escape sequences in some way. Maybe wrap them in `` or put the non-escape sequences in bold? ","{""total_count"": 0, ""+1"": 0, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",453846217, https://github.com/simonw/datasette/issues/506#issuecomment-500183948,https://api.github.com/repos/simonw/datasette/issues/506,500183948,MDEyOklzc3VlQ29tbWVudDUwMDE4Mzk0OA==,9599,2019-06-09T04:22:58Z,2019-06-09T04:26:53Z,OWNER,"This is quite nice: ``` $ od -c /tmp/Thumb64Segment_11.data | head -n 10 0000000 \0 \0 @ \0 \0 \0 005 5 X T S F \0 \0 \0 001 0000020 \0 \0 \0 \0 \0 \0 \0 \0 \0 \0 \0 \0 \0 \0 \0 \0 * 0010000 025 030 . 377 026 032 . 377 027 033 - 377 031 035 0 377 0010020 032 036 1 377 036 5 377 037 ! 8 377 036 "" : 377 0010040 036 8 377 $ : 377 ! & ; 377 $ * ? 377 0010060 ' - ? 377 % * < 377 % , > 377 - 3 E 377 0010100 6 ; M 377 : @ O 377 = C R 377 @ G V 377 0010120 @ I X 377 < B Q 377 8 @ N 377 8 @ P 377 0010140 : C T 377 ; C U 377 : C V 377 9 C W 377 ``` Here's a rough Python equivalent http://code.activestate.com/recipes/579120-data_dumppy-like-the-unix-od-octal-dump-command/","{""total_count"": 0, ""+1"": 0, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",453846217, https://github.com/simonw/datasette/issues/506#issuecomment-500183106,https://api.github.com/repos/simonw/datasette/issues/506,500183106,MDEyOklzc3VlQ29tbWVudDUwMDE4MzEwNg==,9599,2019-06-09T03:59:12Z,2019-06-09T03:59:12Z,OWNER,Another cheap trick is the equivalent of the Unix `strings` command - https://stackoverflow.com/questions/6804582/extract-strings-from-a-binary-file-in-python,"{""total_count"": 0, ""+1"": 0, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",453846217, https://github.com/simonw/datasette/issues/506#issuecomment-500183002,https://api.github.com/repos/simonw/datasette/issues/506,500183002,MDEyOklzc3VlQ29tbWVudDUwMDE4MzAwMg==,9599,2019-06-09T03:56:06Z,2019-06-09T03:56:06Z,OWNER,"What are some other interesting tricks we can use to make binary data a bit more interesting to look at? https://martin.varela.fi/2017/09/09/simple-binary-data-visualization/ has some really clever visualization tricks - probably a bit much for this plugin though. See also https://codisec.com/binary-visualization-explained/ https://github.com/tryexceptpass/perceptio is some much simpler code for rendering an image for a binary. ","{""total_count"": 0, ""+1"": 0, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",453846217, https://github.com/simonw/datasette/issues/506#issuecomment-500173881,https://api.github.com/repos/simonw/datasette/issues/506,500173881,MDEyOklzc3VlQ29tbWVudDUwMDE3Mzg4MQ==,9599,2019-06-08T23:55:52Z,2019-06-08T23:55:52Z,OWNER,"Possible plugin direction: ```python import filetype # https://pypi.org/project/filetype/ @hookimpl(trylast=True) def render_cell(value): if isinstance(value, bytes): info = repr(value) # May still want to truncate this on table view (but not on row page) guess = filetype.guess(value) if guess is not None: # Need jinja2 markup here for \n to display info = ""Guess: mime={}, extension={}\n\n{}"".format( guess.mime, guess.extension, info ) return info return None ```","{""total_count"": 0, ""+1"": 0, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",453846217, https://github.com/simonw/datasette/issues/506#issuecomment-500173464,https://api.github.com/repos/simonw/datasette/issues/506,500173464,MDEyOklzc3VlQ29tbWVudDUwMDE3MzQ2NA==,9599,2019-06-08T23:44:59Z,2019-06-08T23:44:59Z,OWNER,This could also be handled by a plugin.,"{""total_count"": 0, ""+1"": 0, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",453846217,