• bitcoinBitcoin (BTC) $ 112,658.00
  • ethereumEthereum (ETH) $ 4,106.49
  • tetherTether (USDT) $ 1.00
  • bnbBNB (BNB) $ 1,206.14
  • xrpXRP (XRP) $ 2.48
  • solanaSolana (SOL) $ 199.41
  • usd-coinUSDC (USDC) $ 0.999809
  • staked-etherLido Staked Ether (STETH) $ 4,099.28
  • dogecoinDogecoin (DOGE) $ 0.202937
  • tronTRON (TRX) $ 0.315293
  • cardanoCardano (ADA) $ 0.694891
  • wrapped-stethWrapped stETH (WSTETH) $ 4,989.33
  • wrapped-beacon-ethWrapped Beacon ETH (WBETH) $ 4,422.30
  • wrapped-bitcoinWrapped Bitcoin (WBTC) $ 112,660.00
  • chainlinkChainlink (LINK) $ 18.90
  • figure-helocFigure Heloc (FIGR_HELOC) $ 0.995607
  • ethena-usdeEthena USDe (USDE) $ 1.00
  • wrapped-eethWrapped eETH (WEETH) $ 4,426.36
  • stellarStellar (XLM) $ 0.333845
  • hyperliquidHyperliquid (HYPE) $ 39.42
  • bitcoin-cashBitcoin Cash (BCH) $ 534.06
  • suiSui (SUI) $ 2.80
  • avalanche-2Avalanche (AVAX) $ 22.61
  • wethWETH (WETH) $ 4,103.54
  • binance-bridged-usdt-bnb-smart-chainBinance Bridged USDT (BNB Smart Chain) (BSC-USD) $ 1.00
  • leo-tokenLEO Token (LEO) $ 9.63
  • usdsUSDS (USDS) $ 0.999829
  • hedera-hashgraphHedera (HBAR) $ 0.186280
  • coinbase-wrapped-btcCoinbase Wrapped BTC (CBBTC) $ 112,707.00
  • usdt0USDT0 (USDT0) $ 1.00
  • litecoinLitecoin (LTC) $ 97.57
  • mantleMantle (MNT) $ 1.98
  • shiba-inuShiba Inu (SHIB) $ 0.000011
  • whitebitWhiteBIT Coin (WBT) $ 42.75
  • the-open-networkToncoin (TON) $ 2.29
  • ethena-staked-usdeEthena Staked USDe (SUSDE) $ 1.20
  • moneroMonero (XMR) $ 308.98
  • crypto-com-chainCronos (CRO) $ 0.163090
  • polkadotPolkadot (DOT) $ 3.23
  • daiDai (DAI) $ 1.00
  • bittensorBittensor (TAO) $ 459.05
  • uniswapUniswap (UNI) $ 6.75
  • world-liberty-financialWorld Liberty Financial (WLFI) $ 0.144214
  • zcashZcash (ZEC) $ 234.94
  • aaveAave (AAVE) $ 251.50
  • okbOKB (OKB) $ 180.51
  • memecoreMemeCore (M) $ 2.04
  • bitget-tokenBitget Token (BGB) $ 4.84
  • pepePepe (PEPE) $ 0.000007
  • ethenaEthena (ENA) $ 0.432875
  • nearNEAR Protocol (NEAR) $ 2.46
  • aster-2Aster (ASTER) $ 1.45
  • jito-staked-solJito Staked SOL (JITOSOL) $ 246.63
  • blackrock-usd-institutional-digital-liquidity-fundBlackRock USD Institutional Digital Liquidity Fund (BUIDL) $ 1.00
  • usd1-wlfiUSD1 (USD1) $ 1.00
  • susdssUSDS (SUSDS) $ 1.07
  • aptosAptos (APT) $ 3.65
  • ethereum-classicEthereum Classic (ETC) $ 16.75
  • paypal-usdPayPal USD (PYUSD) $ 0.999985
  • c1usdCurrency One USD (C1USD) $ 1.00
  • ondo-financeOndo (ONDO) $ 0.794282
  • binance-peg-wethBinance-Peg WETH (WETH) $ 4,119.46
  • falcon-financeFalcon USD (USDF) $ 0.998542
  • jupiter-perpetuals-liquidity-provider-tokenJupiter Perpetuals Liquidity Provider Token (JLP) $ 5.57
  • story-2Story (IP) $ 6.60
  • worldcoin-wldWorldcoin (WLD) $ 0.955158
  • polygon-ecosystem-tokenPOL (ex-MATIC) (POL) $ 0.200435
  • binance-staked-solBinance Staked SOL (BNSOL) $ 214.11
  • gatechain-tokenGate (GT) $ 16.13
  • internet-computerInternet Computer (ICP) $ 3.55
  • htx-daoHTX DAO (HTX) $ 0.000002
  • kucoin-sharesKuCoin (KCS) $ 14.32
  • arbitrumArbitrum (ARB) $ 0.340753
  • usdtbUSDtb (USDTB) $ 1.00
  • rocket-pool-ethRocket Pool ETH (RETH) $ 4,693.89
  • algorandAlgorand (ALGO) $ 0.203647
  • pi-networkPi Network (PI) $ 0.214550
  • hash-2Provenance Blockchain (HASH) $ 0.035134
  • chainopera-aiChainOpera AI (COAI) $ 8.72
  • bfusdBFUSD (BFUSD) $ 1.00
  • cosmosCosmos Hub (ATOM) $ 3.48
  • vechainVeChain (VET) $ 0.019106
  • wbnbWrapped BNB (WBNB) $ 1,210.77
  • kelp-dao-restaked-ethKelp DAO Restaked ETH (RSETH) $ 4,331.29
  • kaspaKaspa (KAS) $ 0.060460
  • tether-goldTether Gold (XAUT) $ 4,150.47
  • kinetic-staked-hypeKinetiq Staked HYPE (KHYPE) $ 39.44
  • stakewise-v3-osethStakeWise Staked ETH (OSETH) $ 4,328.50
  • pudgy-penguinsPudgy Penguins (PENGU) $ 0.024646
  • liquid-staked-ethereumLiquid Staked ETH (LSETH) $ 4,420.11
  • render-tokenRender (RENDER) $ 2.82
  • skySky (SKY) $ 0.062047
  • flare-networksFlare (FLR) $ 0.018959
  • pump-funPump.fun (PUMP) $ 0.004075
  • sei-networkSei (SEI) $ 0.223829
  • lombard-staked-btcLombard Staked BTC (LBTC) $ 112,914.00
  • renzo-restaked-ethRenzo Restaked ETH (EZETH) $ 4,350.27
  • pax-goldPAX Gold (PAXG) $ 4,155.54
  • official-trumpOfficial Trump (TRUMP) $ 6.28
  • bonkBonk (BONK) $ 0.000016
  • nexoNEXO (NEXO) $ 1.22
  • pancakeswap-tokenPancakeSwap (CAKE) $ 3.47
  • jupiter-exchange-solanaJupiter (JUP) $ 0.372667
  • syrupusdcSyrup USDC (SYRUPUSDC) $ 1.13
  • filecoinFilecoin (FIL) $ 1.67
  • binance-bridged-usdc-bnb-smart-chainBinance Bridged USDC (BNB Smart Chain) (USDC) $ 1.00
  • solv-btcSolv Protocol BTC (SOLVBTC) $ 112,665.00
  • spx6900SPX6900 (SPX) $ 1.21
  • immutable-xImmutable (IMX) $ 0.573473
  • xdce-crowd-saleXDC Network (XDC) $ 0.060358
  • first-digital-usdFirst Digital USD (FDUSD) $ 0.996638
  • mantle-staked-etherMantle Staked Ether (METH) $ 4,419.28
  • morphoMorpho (MORPHO) $ 1.92
  • doublezeroDoubleZero (2Z) $ 0.285933
  • jupiter-staked-solJupiter Staked SOL (JUPSOL) $ 228.41
  • celestiaCelestia (TIA) $ 1.15
  • injective-protocolInjective (INJ) $ 9.54
  • arbitrum-bridged-wbtc-arbitrum-oneArbitrum Bridged WBTC (Arbitrum One) (WBTC) $ 112,924.00
  • solmevSolMev (SN116) $ 2,398.72
  • clbtcclBTC (CLBTC) $ 115,168.00
  • fasttokenFasttoken (FTN) $ 2.02
  • lido-daoLido DAO (LDO) $ 0.958022
  • optimismOptimism (OP) $ 0.479002
  • ripple-usdRipple USD (RLUSD) $ 0.999868
  • blockstackStacks (STX) $ 0.467264
  • msolMarinade Staked SOL (MSOL) $ 265.72
  • curve-dao-tokenCurve DAO (CRV) $ 0.589842
  • fetch-aiArtificial Superintelligence Alliance (FET) $ 0.320274
  • plasmaPlasma (XPL) $ 0.448242
  • aerodrome-financeAerodrome Finance (AERO) $ 0.887554
  • cgeth-hashkey-cloudcgETH Hashkey Cloud (CGETH.HASH) $ 3,932.20
  • ousgOUSG (OUSG) $ 112.92
  • global-dollarGlobal Dollar (USDG) $ 0.999991
  • sonic-3Sonic (S) $ 0.199405
  • the-graphThe Graph (GRT) $ 0.069454
  • l2-standard-bridged-weth-baseL2 Standard Bridged WETH (Base) (WETH) $ 4,104.63
  • flokiFLOKI (FLOKI) $ 0.000074
  • superstate-short-duration-us-government-securities-fund-ustbSuperstate Short Duration U.S. Government Securities Fund (USTB) (USTB) $ 10.85
  • pyth-networkPyth Network (PYTH) $ 0.123410
  • ondo-us-dollar-yieldOndo US Dollar Yield (USDY) $ 1.09
  • usdx-money-usdxStables Labs USDX (USDX) $ 0.997654
  • havvenSynthetix (SNX) $ 1.97
  • saros-financeSaros (SAROS) $ 0.257134
  • kaiaKaia (KAIA) $ 0.112686
  • tbtctBTC (TBTC) $ 112,572.00
  • tezosTezos (XTZ) $ 0.618175
  • arbitrum-bridged-weth-arbitrum-oneArbitrum Bridged WETH (Arbitrum One) (WETH) $ 4,108.49
  • ether-fiEther.fi (ETHFI) $ 1.23
  • aethirAethir (ATH) $ 0.044507
  • stader-ethxStader ETHx (ETHX) $ 4,396.65
  • gtethGTETH (GTETH) $ 4,117.97
  • newton-projectAB (AB) $ 0.007595
  • pendlePendle (PENDLE) $ 3.66
  • iotaIOTA (IOTA) $ 0.150287
  • conflux-tokenConflux (CFX) $ 0.117095
  • myx-financeMYX Finance (MYX) $ 3.17
  • usdaiUSDai (USDAI) $ 1.03
  • beldexBeldex (BDX) $ 0.079325
  • trust-wallet-tokenTrust Wallet (TWT) $ 1.39
  • dogwifcoindogwifhat (WIF) $ 0.575908
  • theta-tokenTheta Network (THETA) $ 0.569928
  • dashDash (DASH) $ 45.29
  • ethereum-name-serviceEthereum Name Service (ENS) $ 16.97
  • galaGALA (GALA) $ 0.011998
  • coinbase-wrapped-staked-ethCoinbase Wrapped Staked ETH (CBETH) $ 4,514.70
  • usual-usdUsual USD (USD0) $ 0.999076
  • the-sandboxThe Sandbox (SAND) $ 0.224627
  • swethSwell Ethereum (SWETH) $ 4,495.21
  • starknetStarknet (STRK) $ 0.126475
  • mantle-restaked-ethMantle Restaked ETH (CMETH) $ 4,411.42
  • raydiumRaydium (RAY) $ 2.00
  • bitcoin-avalanche-bridged-btc-bAvalanche Bridged BTC (Avalanche) (BTC.B) $ 112,573.00
  • virtual-protocolVirtuals Protocol (VIRTUAL) $ 0.815837
  • jasmycoinJasmyCoin (JASMY) $ 0.010860
  • rna-2RNA (SN117) $ 4,708.96
  • binance-peg-dogecoinBinance-Peg Dogecoin (DOGE) $ 0.203629
  • decentralandDecentraland (MANA) $ 0.270875
  • bittorrentBitTorrent (BTT) $ 0.00000052
  • eigenlayerEigenCloud (prev. EigenLayer) (EIGEN) $ 1.34
  • swissborgSwissBorg (BORG) $ 0.519831
  • mantle-bridged-usdt-mantleMantle Bridged USDT (Mantle) (USDT) $ 1.00
  • vaultaVaulta (A) $ 0.313679
  • astherus-staked-bnbAster Staked BNB (ASBNB) $ 1,278.55
  • true-usdTrueUSD (TUSD) $ 1.00
  • arbitrum-bridged-wrapped-eethArbitrum Bridged Wrapped eETH (Arbitrum) (WEETH) $ 4,419.61
  • steakhouse-usdc-morpho-vaultSteakhouse USDC Morpho Vault (STEAKUSDC) $ 1.10
  • usddUSDD (USDD) $ 1.00
  • flowFlow (FLOW) $ 0.295955
  • bridged-usdc-polygon-pos-bridgePolygon Bridged USDC (Polygon PoS) (USDC.E) $ 0.999801
  • syrupMaple Finance (SYRUP) $ 0.427904
  • polygon-pos-bridged-dai-polygon-posPolygon PoS Bridged DAI (Polygon POS) (DAI) $ 0.999822
  • zero-gravity0G (0G) $ 2.20
  • ether-fi-staked-ethether.fi Staked ETH (EETH) $ 4,087.86
  • sun-tokenSun Token (SUN) $ 0.024132
  • ai-companionsAI Companions (AIC) $ 0.458210
  • jito-governance-tokenJito (JTO) $ 1.16
  • bitcoin-svBitcoin SV (BSV) $ 22.57
  • frax-etherFrax Ether (FRXETH) $ 4,068.24
  • polygon-pos-bridged-weth-polygon-posPolygon PoS Bridged WETH (Polygon POS) (WETH) $ 4,102.71
  • crvusdcrvUSD (CRVUSD) $ 1.00

Meta Unveils Open Source Llama 3.2: AI That Sees And Fits in Your Pocket

0 68

Meta Unveils Open Source Llama 3.2: AI That Sees And Fits in Your Pocket

It’s been a good week for open-source AI.

On Wednesday, Meta announced an upgrade to its state-of-the-art large language model, Llama 3.2, and it doesn’t just talk—it sees.

More intriguing, some versions can squeeze into your smartphone without losing quality, which means you could potentially have private local AI interactions, apps and customizations without sending your data to third party servers.

Unveiled Wednesday during Meta Connect, Llama 3.2 comes in four flavors, each packing a different punch. The heavyweight contenders—11B and 90B parameter models—flex their muscles with both text and image processing capabilities.

They can tackle complex tasks such as analyzing charts, captioning images, and even pinpointing objects in pictures based on natural language descriptions.

Llama 3.2 arrived the same week as Allen Institute’s Molmo, which claimed to be the best open-source multimodal vision LLM in synthetic benchmarks, performing in our tests on par with GPT-4o, Claude 3.5 Sonnet, and Reka Core.

Zuck’s company also introduced two new flyweight champions: a pair of 1B and 3B parameter models designed for efficiency, speed, and limited but repetitive tasks that don’t require too much computation.

These small models are multilingual text maestros with a knack for “tool-calling,” meaning they can integrate better with programming tools. Despite their diminutive size, they boast an impressive 128K token context window—the same as GPT4o and other powerful models—making them ideal for on-device summarization, instruction following, and rewriting tasks.

Meta’s engineering team pulled off some serious digital gymnastics to make this happen. First, they used structured pruning to trim the unnecessary data from larger models, then employed knowledge distillation—transferring knowledge from large models to smaller ones—to squeeze in extra smarts.

The result was a set of compact models that outperformed rival competitors in their weight class, besting models including Google’s Gemma 2 2.6B and Microsoft’s Phi-2 2.7B on various benchmarks.

Meta Unveils Open Source Llama 3.2: AI That Sees And Fits in Your Pocket

Meta is also working hard to boost on-device AI. They’ve forged alliances with hardware titans Qualcomm, MediaTek, and Arm to ensure Llama 3.2 plays nice with mobile chips from day one. Cloud computing giants aren’t left out either—AWS, Google Cloud, Microsoft Azure, and a host of others are offering instant access to the new models on their platforms.

Under the hood, Llama 3.2’s vision capabilities come from clever architectural tweaking. Meta’s engineers baked in adapter weights onto the existing language model, creating a bridge between pre-trained image encoders and the text-processing core.

In other words, the model’s vision capabilities don’t come at the expense of its text processing competence, so users can expect similar or better text results when compared to Llama 3.1.

The Llama 3.2 release is Open Source—at least by Meta’s standards. Meta is making the models available for download on Llama.com and Hugging Face, as well as through their extensive partner ecosystem.

Those interested in running it on the cloud can use their own Google Collab Notebook or use Groq for text-based interactions, generating nearly 5000 tokens in less than 3 seconds.

Meta Unveils Open Source Llama 3.2: AI That Sees And Fits in Your Pocket

Riding the Llama

We put Llama 3.2 through its paces, quickly testing its capabilities across various tasks.

In text-based interactions, the model performs on par with its predecessors. However, its coding abilities yielded mixed results.

When tested on Groq’s platform, Llama 3.2 successfully generated code for popular games and simple programs. Yet, the smaller 70B model stumbled when asked to create functional code for a custom game we devised. The more powerful 90B, however, was a lot more efficient and generated a functional game on the first try.

Meta Unveils Open Source Llama 3.2: AI That Sees And Fits in Your Pocket

You can see the full code generated by Llama-3.2 and all the other models we tested by clicking on this link.

Identifying styles and subjective elements in images

Meta Unveils Open Source Llama 3.2: AI That Sees And Fits in Your Pocket

Llama 3.2 excels at identifying subjective elements in images. When presented with a futuristic, cyberpunk-style image and asked if it fit the steampunk aesthetic, the model accurately identified the style and its elements. It provided a satisfactory explanation, noting that the image didn’t align with steampunk due to the absence of key elements associated with that genre.

Chart Analysis (and SD image recognition)

Meta Unveils Open Source Llama 3.2: AI That Sees And Fits in Your Pocket

Chart analysis is another strong suit for Llama 3.2, though it does require high-resolution images for optimal performance. When we input a screenshot containing a chart—one that other models like Molmo or Reka could interpret—Llama’s vision capabilities faltered. The model apologized, explaining that it couldn’t read the letters properly due to the image quality.

Text in Image Identification

Meta Unveils Open Source Llama 3.2: AI That Sees And Fits in Your Pocket

While Llama 3.2 struggled with small text in our chart, it performed flawlessly when reading text in larger images. We showed it a presentation slide introducing a person, and the model successfully understood the context, distinguishing between the name and job role without any errors.

Verdict

Overall, Llama 3.2 is a big improvement over its previous generation and is a great addition to the open-source AI industry. Its strengths are in image interpretation and large-text recognition, with some areas for potential improvement, particularly in processing lower-quality images and tackling complex, custom coding tasks.

The promise of on-device compatibility is also good for the future of private and local AI tasks and is a great counterweight to close offers like Gemini Nano and Apple’s proprietary models.

Edited by Josh Quittner and Sebastian Sinclair

Source

Leave A Reply

Your email address will not be published.