• brucethemoose@lemmy.world
    link
    fedilink
    English
    arrow-up
    2
    arrow-down
    2
    ·
    14 days ago

    It’s not open source, but open weights, documented, relatively permissively licensed and all the inference/finetuning libraries for it are open source.

    • MCasq_qsaCJ_234@lemmy.zip
      link
      fedilink
      English
      arrow-up
      2
      ·
      14 days ago

      I understand, but Meta has the rights to Llama and at any time they can change that license to make it less open just to make more money.

      Currently it is open weight to attract customers, because once there are no competitors they will start to squeeze them.

      • brucethemoose@lemmy.world
        link
        fedilink
        English
        arrow-up
        3
        ·
        14 days ago

        Also competition is stiff. Alibaba is currently handing their butts to them with Qwen 2.5. Deepseek (a Chinese startup), tencent and Mistral (French) are giving them a run for their money too, and there are even some that “continue train” their old weights.

          • brucethemoose@lemmy.world
            link
            fedilink
            English
            arrow-up
            1
            ·
            14 days ago

            A small startup called Arcee AI actually “distilled” logits from several other models (Llama, Mistral) and used the data to continue train Qwen 2.5 14B (which itself is Apache 2.0). It’s called supernova medius, and it’s quite incredible for a 14B model… SOTA as far as I know, even with their meager GPU resources.

            A company called upstage “expands” models to larger parameter counts by continue training them. Look up the SOLAR series.

            And quite notably, Nvidia continue trained Llama 3.1 70B and published the weights as Nemotron 70B. It was the best 70B model for awhile, and may still be in some areas.

            And some companies like Cohere continuously train the same model slowly, and offer it over API, but occasionally publish the weights to promote them.

            • MCasq_qsaCJ_234@lemmy.zip
              link
              fedilink
              English
              arrow-up
              1
              ·
              14 days ago

              The fact that there is AI with open source licenses is already a good thing, as is the competition. Although in my opinion it is not enough because it can further consolidate oligopolies in this sector.

              Trying to prevent OpenAI from becoming a for-profit seems to me to be a questionable tactic. It’s as if Mozilla wanted to be a for-profit company in order to make Firefox more competitive with Chrome, but Google opposes this measure.

              • brucethemoose@lemmy.world
                link
                fedilink
                English
                arrow-up
                1
                ·
                14 days ago

                Well for one, I directly disagree with Altman’s fundamental proposition, they don’t need to “scale” AI so dramatically to make it better.

                See: Qwen 2.5 from Alibaba, a fraction of the size, made with a tiny fraction of the H100 GPUs and highly competitive (and (mostly) Apache licensed). And frankly, OpenAI is pointedly ignoring all sorts of open research that could make their models notably better or more powerful efficient, even with the vast resources and prestige they have… they seem most interested in anticompetitive efforts to regulate competitors that would make them look bad, using the spectre of actual AGI (which has nothing to do with transformers LLMs) to scare people.

                Even if doing it for the wrong reasons, I feel like Google would be right to oppose Mozilla axing the nonprofit division if they were somehow in a similar position to OpenAI. Their mission of producing a better, safer browser would basically be lying through their teeth.

                • MCasq_qsaCJ_234@lemmy.zip
                  link
                  fedilink
                  English
                  arrow-up
                  1
                  ·
                  14 days ago

                  Open AI has different priorities they want to achieve AGI, so they seek to explore the capabilities of AI not look at what competency does in those directions to replicate and/or improve it. They only optimize it to make their services faster and less resource consuming.

                  Also, becoming a for-profit organization doesn’t mean you eliminate your non-profit division. Those two parts separate and become independent, although the nonprofit ends up getting considerable funds from the funding offer received by the other part.

                  As is the case with Mastercard, whose nonprofit organization is one of the richest in the world. In that scenario Mozilla would split into two entities one would focus on making a profit and making Firefox more competitive, while the other would focus on what Mozilla currently does.

      • brucethemoose@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        ·
        14 days ago

        No, they can’t, because you can just pull the git repo with the old license as use them as they were at the time of upload, just like any software on a git repository. And too many people have them downloaded to delete them from the internet.

        There are also finetunes inheriting the old license, and those orga are not going to pull the weights.

        • MCasq_qsaCJ_234@lemmy.zip
          link
          fedilink
          English
          arrow-up
          2
          ·
          14 days ago

          And in that case, will the Llama fork be the same as the Meta fork? We are talking about AI that has a considerable development, companies would probably not participate because it is not an open source license and its clause limits in those aspects.

          Also you have to think that if the new version of Llama with the new license is 3 times better than Llama with the previous license, do you really think that the community will continue to develop the previous version?

          • brucethemoose@lemmy.world
            link
            fedilink
            English
            arrow-up
            1
            ·
            edit-2
            14 days ago

            And in that case, will the Llama fork be the same as the Meta fork? We are talking about AI that has a considerable development, companies would probably not participate because it is not an open source license and its clause limits in those aspects.

            Llama has tons of commercial use even with its “non open” license, which is basically just a middle finger to companies the size of Google or Microsoft. And yes, companies would keep using the old weights like nothing changed… because nothing did. Just like they keep using open source software that goes through drama.

            Also you have to think that if the new version of Llama with the new license is 3 times better than Llama with the previous license, do you really think that the community will continue to develop the previous version?

            Honestly I have zero short term worries about this because the space is so fiercely competitive. If Llama 3 was completely closed, the open ecosystem would have been totally fine without Meta.

            Also much of the ecosystem (like huggingface and inference libraries) is open source and out of their control.

            And if they go API only, they will just get clobbered by Google, Claude, Deepseek or whomever.

            In the longer term… these transformers style models will be obsolete anyway. Honestly I have no idea what the landscape will look like.

            • MCasq_qsaCJ_234@lemmy.zip
              link
              fedilink
              English
              arrow-up
              1
              ·
              14 days ago

              Well, I agree that we don’t know what the situation will look like over time.

              There may be a limit that will cause another AI winter, driving companies away for a while because they invested money and received little.

              Transformers may remain relevant or end up being obsolete, although there are still many papers related to AI in one way or another.

              • brucethemoose@lemmy.world
                link
                fedilink
                English
                arrow-up
                1
                ·
                14 days ago

                The limit is already (apparently) starting to be data… and capital, lol.

                There could be a big computer breakthrough like , say, fast bitnet training that makes the multimodal approach much easier to train though.

                • MCasq_qsaCJ_234@lemmy.zip
                  link
                  fedilink
                  English
                  arrow-up
                  1
                  ·
                  14 days ago

                  I think this method is not convincing for companies, because they prefer more power and to do it on their own, because they don’t want their ideas to be replicated.