misk@sopuli.xyz to Technology@lemmy.worldEnglish · edit-223 hours ago'Meta Torrented over 81 TB of Data Through Anna's Archive, Despite Few Seeders'torrentfreak.comexternal-linkmessage-square73fedilinkarrow-up1589arrow-down12cross-posted to: [email protected][email protected]
arrow-up1587arrow-down1external-link'Meta Torrented over 81 TB of Data Through Anna's Archive, Despite Few Seeders'torrentfreak.commisk@sopuli.xyz to Technology@lemmy.worldEnglish · edit-223 hours agomessage-square73fedilinkcross-posted to: [email protected][email protected]
minus-squareFooBarrington@lemmy.worldlinkfedilinkEnglisharrow-up2arrow-down2·5 hours agoI support FOSS LLMs, but which actually exist? Which LLMs have open-sourced all their training data?
minus-squareLainTrain@lemmy.dbzer0.comlinkfedilinkEnglisharrow-up1·edit-22 hours agoMistral? Deepseek? Not LLM but also SD which uses a very popular free dataset.
minus-squareFooBarrington@lemmy.worldlinkfedilinkEnglisharrow-up1·2 hours agoCan I freely download all the training data for any of those? I was under the impression they were all trained on non-licensed and copyrighted data.
I support FOSS LLMs, but which actually exist? Which LLMs have open-sourced all their training data?
Mistral? Deepseek?
Not LLM but also SD which uses a very popular free dataset.
Can I freely download all the training data for any of those? I was under the impression they were all trained on non-licensed and copyrighted data.