I want to extract and process the metadata from PNG images and the first line of .safetensors files for LLM’s and LoRA’s. I could spend ages farting around with sed or awk but formats of files are constantly changing. I’d like a faster way to see a summary of training and a few other details when they are available.

    • huginn@feddit.it
      link
      fedilink
      arrow-up
      10
      ·
      1 year ago

      I have a very handy command in my .vimrc for this -

      command! JSON setlocal filetype=json | %!jq .

      Anytime I’m in a json file that isn’t formatted it’s as simple as typing :JSON to have it all sorted.

    • pingveno@lemmy.ml
      link
      fedilink
      English
      arrow-up
      1
      ·
      1 year ago

      Yeah, I’ve been learning some nushell. If you’re dealing with data, it’s just a great tool. So many sharp edges in the POSIX shell come from it being stringly typed, so having a strongly typed shell is extremely helpful.

  • Nibodhika@lemmy.world
    link
    fedilink
    arrow-up
    5
    ·
    1 year ago

    A week ago I would have said jq, but just the other day I discovered nushell and have been loving it, if you deal with structured data often it’s way easier, just bear in mind it’s not POSIX compatible

  • Hammerheart@programming.dev
    link
    fedilink
    arrow-up
    4
    ·
    1 year ago

    What are some goos resources for learning jq? I really struggle when it comes to nested keys/values which obviously limits my ability to use it.

      • Hammerheart@programming.dev
        link
        fedilink
        arrow-up
        2
        arrow-down
        1
        ·
        edit-2
        1 year ago

        I have perused it, but its both so dense and so broad that its not that helpful unless i know exactly what I’m looking for. I have also tried info and tldr. I actually like tldr the most,. although the exhaustiveness of the man pages must be admired. I dont find it to be the best teacher.

  • Diplomjodler@lemmy.world
    link
    fedilink
    arrow-up
    3
    ·
    1 year ago

    Python is very good for working with JSON. Definitely will get you there faster than awk for anything not completely trivial.

    • 𞋴𝛂𝛋𝛆@lemmy.worldOP
      link
      fedilink
      English
      arrow-up
      2
      ·
      1 year ago

      I found a Python project that does enough for my needs. Jq looks super powerful though. Thanks. I managed to get yq working for PNG’s, but I had trouble with both jq and yq with safetensor files. I couldn’t figure out how to parse a string embedded in an inconsistent starting binary, and with massive files. I could get in and grab the first line with head. I tried some stuff with expansions, but that didn’t work and sent me looking for others that have solved the issue better than myself.

  • CaptPretentious@lemmy.world
    link
    fedilink
    arrow-up
    2
    arrow-down
    4
    ·
    1 year ago

    Probably not popular opinion, but pwsh (powershell). It’s got a lot of tooling built in and means I don’t have to learn a different tool just because I’m in a different system.