• BetaDoggo_@lemmy.world
    link
    fedilink
    English
    arrow-up
    3
    ·
    8 months ago

    It’s size makes it basically useless. It underperforms models even in it’s active weight class. It’s nice that it’s available but Grok-0 would have been far more interesting.

  • CyberSeeker@discuss.tchncs.de
    link
    fedilink
    English
    arrow-up
    1
    ·
    8 months ago

    Who cares if the code is open source, or pre-training weights are released? Virtually every Masters in CS student in 2024 is building this from scratch. The differentiator is the training dataset, or at worst, the weights after fine tuning the model.