homedark

Zig's std.json.parseFormSlice and std.json.Parsed(T)

Jun 18, 2024

In Zig's discord server, I see a steady stream of developers new to Zig struggling with parsing JSON. I like helping with this problem because you can learn a lot about Zig through it. A typical, but incorrect, first attempt looks something like:

const std = @import("std");
const Allocator = std.mem.Allocator;

const Config = struct {
  db_path: []const u8,
};

pub fn main() !void {
  var gpa = std.heap.GeneralPurposeAllocator(.{}){};
  defer _ = gpa.deinit();
  const allocator = gpa.allocator();

  const config = try parseConfig(allocator, "config.json");

  std.debug.print("db path: {s}\n", .{config.db_path});
}

fn parseConfig(allocator: Allocator, path: []const u8) !Config {
  const data = try std.fs.cwd().readFileAlloc(allocator, path, 4096);
  defer allocator.free(data);

  const parsed = try std.json.parseFromSlice(Config, allocator, data, .{});
  return parsed.value;
}

This code has two bugs: a dangling pointer and memory leak. Because the use of a dangling pointer is an undefined behavior, this code may or may not crash on the last line of main (which tries to print db_path), but it definitely will report a memory leak.

Dangling Pointer

First we'll fix the dangling pointer. The parseFromSlice function takes our JSON input (data) and tries to parse it into a Config. Our JSON input comes from reading the file. Thanks to the call to defer allocator.free(data), this data is freed when parseConfig exits. This is the source of our bug. By default, parseFromSlice uses references to the underlying JSON input. So when data is freed, those references are no longer valid.

The last parameter to parseFromSlice is a ParseOption. It controls various aspects of parsing. The option we care about is allocate which defaults to .alloc_if_needed. We need to pass .{.allocate = .alloc_always} to fix our dangling pointer:

... = try std.json.parseFromSlice(Config, allocator, data, .{
      .allocate = .alloc_always,
  });

We could consider this fixed, and move on to our memory leak, but let's go a bit deeper. The ability to reference the input data is an optimization. When the input outlives the parsed value, it makes sense to simply reference the existing input. To see this in action, we could change our code so that data outlives our parsed value and not used the alloc_always setting:

pub fn main() !void {
  var gpa = std.heap.GeneralPurposeAllocator(.{}){};
  defer _ = gpa.deinit();
  const allocator = gpa.allocator();

  const data = try std.fs.cwd().readFileAlloc(allocator, "config.json", 4096);
  defer allocator.free(data);

  const parsed = try std.json.parseFromSlice(Config, allocator, data, .{});
  std.debug.print("db path: {s}\n", .{parsed.value.db_path});
}

This version doesn't have a dangling pointer and doesn't have to copy the string values of our JSON input to populate the fields of Config. Because data outlives parsed, references from parsed.value to data remain valid (although, someone could mutate data, but that's another problem). This version still has the same memory leak though (we'll get to that soon).

The allocate option has two possible values. The default is alloc_if_needed and the other, the one we used to fix the code, is alloc_always. You could be forgiven for thinking that our code should have worked with the default, alloc_if_needed. After all, our code needed the string value duplicated since our parsed Config outlives the input, right? But the "if needed" part of alloc_if_needed references the parser itself: allocate if the JSON parser needs it. If you think about it, this makes sense. There's no way for parseFromSlice to know that the parsed value outlives the JSON input. I think this option would be less confusing as a boolean called dupe_strings.

When does the parser itself need to make allocations? There are two cases. The first is for internal bookkeeping. Specifically, since JSON can be arbitrarily nested, and nesting can be a mix of arrays and objects, a parser needed to track of the type of value (object or array) of each nesting. Secondly, parseFromSlice is a higher-level API over a low-level JSON scanner. That scanner is more generic and works over a stream of data. If the JSON input is being streamed in, one chunk at a time, a string value might span multiple chunks. In such cases, the string parts must be duped by the scanner in order to produce a single cohesive string value. Because of these two cases, there's no none option to the allocate. An specialized scanner with a max supported nesting, that only worked on a long-lived full JSON input could be implemented without allocations. But that isn't how Zig's scanner works.

Memory Leak

For the second issue, the memory leak, recall the original code called parseFromSlice and assigned the return value to a variable name result and then returned result.value. That's immediately suspicious. If nothing else, what exactly does parseFromSlice return, and why doesn't it return a T (Config in our example) directly? From Zig's documentation, we know that the return type is a std.json.Parsed(T), but that type isn't documented.

Parsed(T) is a simple type. It's obviously a generic and has two fields:

pub fn Parsed(comptime T: type) type {
  return struct {
    value: T,
    arena: *ArenaAllocator,

    pub fn deinit(self: @This()) void {
      //...
    }
  };
}

In the previous section we saw that parsing JSON almost always requires allocations, for internal bookkeeping and/or for duping strings. If we don't free those allocations, they'll leak. By using an ArenaAllocator, it's much easier (and faster) to manage the memory as a whole.

The dangling pointer happened because the JSON input had a shorter lifetime than our config, resulting in config referencing no-longer valid memory. Our memory leak is kind of the opposite: by returning only result.value, we've lost any reference to the arena and thus can never free those allocations.

One solution is to change parseConfig so that it returns Parsed(Config). This allows our caller to deinit the arena:

pub fn main() !void {
  var gpa = std.heap.GeneralPurposeAllocator(.{}){};
  defer _ = gpa.deinit();
  const allocator = gpa.allocator();

  const parsed = try parseConfig(allocator, "config.json");
  defer parsed.deinit();

  std.debug.print("db path: {s}\n", .{parsed.value.db_path});
}

fn parseConfig(allocator: Allocator, path: []const u8) !std.json.Parsed(Config) {
  const data = try std.fs.cwd().readFileAlloc(allocator, path, 4096);
  defer allocator.free(data);

  return std.json.parseFromSlice(Config, allocator, data, .{
      .allocate = .alloc_always,
  });
}

You can't call parsed.deinit() inside of parseConfig and then return parsed.value. Then we'd re-introduce a different dangling pointer where the config references memory allocated by the arena which has since been freed. The arena and value are tightly linked, they exist as a single unit.

Personally, I don't like the name std.json.Parsed(T). There's nothing JSON-specific about this type. It's just a value of type T and an ArenaAllocator. I would prefer if the type was something like std.Owned(T). But whatever its name, the value and arena are one and share a lifetime.

In addition to parseFromSlice there's also a parseFromSliceLeaky. The "leaky" version returns T directly. This version is written assuming that the provided allocator is able to free all allocations, without having to track every individual allocations. It essentially assumes that the provided allocator is something like an ArenaAllocator or a FixedBufferAllocator. In practical terms, parseFromSlice internally creates an ArenaAllocator and returns that allocator with the parsed value whereas parseFromSliceLeaky takes an ArenaAllocator (or something like it) and returns the parsed value. In both cases, you end up with a value of type T tied to an allocator.

Conclusion

The short version is that Zig's JSON parser has the ability to either reference the JSON input or make copies of the [string] values. The default is to reference the JSON input, which causes issues if the JSON input does not outlive the parsed value. The benefit is better performance (fewer allocations) for cases where the input does outlive the parsed value.

Furthermore, since parsing JSON can require allocations, parseFromSlice returns both the parsed value and an ArenaAllocator. To prevent memory leaks, deinit must be called on the Parse(T) returned by the function, at which point the value is no longer valid.

"But the grooms, or 'bettos,' as the Japanese call them, are not the only ones who indulge in tattooing. You will see many of the 'sendos,' or boat-coolies, thus marked, but in a less degree than the bettos. Perhaps it is because the grooms are obliged to run so much, and consequently wish to lay aside all garments. As they must wear something, they have their skins decorated in this way, and thus have a suit of clothing always about them. Joe Johnston's shattered army was at Jackson, about forty-five miles to northward; beleaguered Vicksburg was in the Northwest, a trifle farther away; Natchez lay southwest, still more distant; and nearly twice as far in the south was our heartbroken New Orleans. We had paused to recuperate our animals, and there was a rumor that we were to get new clothing. Anyhow we had rags with honor, and a right to make as much noise as we chose. "Nonsense, my dear fellow. Now let me open your eyes. Behold the great force of a man who is gifted with second sight. Where did you get those notes? Was it not on the same evening as the murder?" "Oh, nothing," came the reply. "Only I was so silly as to place the wrong end of my cigarette in my mouth and burnt my lips. What's tuberose?" Reviewing these mechanical conditions, we may at once see sufficient reasons for the platen movement of planing machines; and that it would be objectionable, if not impossible, to add a traversing or cutting action to tools already supported through the medium of eight joints. To traverse for cutting would require a moving gib joint in place of the bolted one, between the standards and main frame, leading to a complication of joints and movements quite impracticable. And disadvantaged aye begins the strife. We have already seen how this fundamental division is applied to the universe as a whole. But our philosopher is not content with classifying the phenomena as he finds360 them; he attempts to demonstrate the necessity of their dual existence; and in so doing is guilty of something very like a vicious circle. For, after proving from the terrestrial movements that there must be an eternal movement to keep them going, he now assumes the revolving aether, and argues that there must be a motionless solid centre for it to revolve round, although a geometrical axis would have served the purpose equally well. By a still more palpable fallacy, he proceeds to show that a body whose tendency is towards the centre, must, in the nature of things, be opposed by another body whose tendency is towards the circumference. In order to fill up the interval created by this opposition, two intermediate bodies are required, and thus we get the four elementsearth, water, air, and fire. These, again, are resolved into the antithetical couples, dry and wet, hot and cold, the possible combinations of which, by twos, give us the four elements once more. Earth is dry and cold, water cold and wet, air wet and hot, fire hot and dry; each adjacent pair having a quality in common, and each element being characterized by the excess of a particular quality; earth is especially dry, water cold, air wet, and fire hot. The common centre of each antithesis is what Aristotle calls the First Matter, the mere abstract unformed possibility of existence. This matter always combines two qualities, and has the power of oscillating from one quality to another, but it cannot, as a rule, simultaneously exchange both for their opposites. Earth may pass into water, exchanging dry for wet, but not so readily into air, which would necessitate a double exchange at the same moment. 192 He helped her out. "I have drifted in a way," he went on to explain. "I left home when I was a mere boy, and the spirit of savagery and unrest laid hold of me. I can't break away. And I'm not even sure that I want to. You, I dare say, can't understand." Yet he felt so sure, for some reason, that she could that he[Pg 71] merely nodded his head when she said briefly, "I can." "Then, too," he went on, "there is something in the Indian character that strikes a responsive chord in me. I come of lawless stock myself. I was born in Sidney." Then he stopped short. What business was it of hers where he had been born? He had never seen fit to speak of it before. Nevertheless he intended that she should understand now. So he made it quite plain. "Sidney was a convict settlement, you know," he said deliberately, "and marriages were promiscuous. My grandfather was an officer who was best away from England. My grandmother poisoned her first husband. That is on my mother's side. On my father's side it was about as mixed." He leaned back, crossing his booted legs and running his fingers into his cartridge belt. His manner asked with a certain defiance, what she was going to do about it, or to think. "Naw; git out. Don't bother me with no questions, I tell you," impatiently said a man in citizen's clothes, who with arms outspread was signalling the switching engines. "'Tain't my business to give information to people. Got all I kin do to furnish brains for them bull-headed engineers. Go to that Quartermaster you see over there in uniform. The Government pays him for knowin' things. It don't me." As little Pete dropped to the ground, his nervous finger touched the trigger and his gun went off up in the air. The others took this as a cue, and banged away as rapidly as they could get their muskets off. "I am no master," Cadnan said wearily. "I am a slave." "The people know," Dodd said. "It's out. It's all out. About the slavery. Is that what you mean?" Cadnan peered at him, half-fearfully. "You are a master." One did not give orders to masters, or argue with them. "Why not? There's naun shameful in it. Munds's brother did it for twenty years. And think of the difference it'll m?ake to usthirty pound or so a year, instead of the dead loss of Harry's keep and the wages of an extra man beside. I tell you, mother, I wur fair sick about the farm till I thought of this." "Ben, I swear I'm your true wife." Reuben started, and Pete awoke noisily. Harry was frightened and dropped his string, crying because he could not find it. The knock came again, and this time Pete crossed the room yawning, and opened the door. simeitelight.com.cn
king365.com.cn
chuandazhi.com.cn
gryd.com.cn
cqcq10.com.cn
www.prcpe.net.cn
dxdj093.com.cn
hls123.com.cn
kinye.com.cn
www.lnxld.com.cn
中韩欧美一级一中文字暮 _黄片BB亚洲AV无码天堂www亚洲国产韩国欧美在线不卡一级 _毛片机地男男性行为免费视频播放九九欧美一级毛欧美片 _啪拍看片久色综合免费福利视频玖玖60岁欧美老妇一级毛 中韩欧美一级一中文字暮 中文字幕第十九页 中央游月中文字幕 中文字幕 第9页 中文字幕午夜福利 黄片BB亚洲AV无码天堂www亚洲国产韩国欧美在线不卡一级 中文字幕欧美日韩 中村知惠中文字幕 啪啪啪在线视频 中文字幕亚洲综合 中文字幕先锋资源 中文字幕 青青草 中文字幕资源网站 中韩欧美一级一中文字暮 中文字幕卡通动漫 中文字幕理论电影 免费视频播放 中文字幕资源在线 毛片免费观看 黄色一级电影片 中国美女一级看片 中文字幕先锋影音 黄片BB亚洲AV无码天堂www亚洲国产韩国欧美在线不卡一级 欧美在线不卡 中文字幕巨乳有码 啪拍看片久色综合免费福利视频玖玖60岁欧美老妇一级毛 中文字幕第86页 中文字幕91在线 黄色一级录像片 中文字幕久荜在线 黄色一级欧美片 毛片视频在线 中文字幕 第7页 中文字幕每日更新 毛片免费试看 中文字幕巨乱亚洲 中韩欧美一级一中文字暮 啪拍看片久色综合免费福利视频玖玖60岁欧美老妇一级毛 中韩欧美一级一中文字暮 毛片机地男男性行为免费视频播放九九欧美一级毛欧美片 黄色一级录像带 中文在线视频观看 中文字幕Av电影 啪啪啪男女视频 中文字幕 新妈妈 中国一级特黄大片 中文字幕久久视频 中文字幕黄色视频 中国黄色一级大片 中文字幕手机看片 中文字幕伦理在线 欧美一级毛 中韩欧美一级一中文字暮 免费视频播 啪啪免费视频网站 中文字幕男人天堂 黄色一级伦理片 中文字幕亚洲在线 中文字幕视频不卡 毛片免费网址 中国一级黄色大片 中文字幕亚洲情字 啪啪视频在线播放 啪拍看片久色综合免费福利视频玖玖60岁欧美老妇一级毛 中韩欧美一级一中文字暮 啪啪免费在线视频 亚洲AV无码天堂 中文字幕丝袜美腿 毛片免费视频 毛片在线电影 毛片在线不卡 啪啪啪在线播放 中文字幕伦理电影 中文字幕中文字幕 中文字幕一级在线 毛片免费基地 中文字幕综合影院 中文字幕手机在线 中文字幕视频在线 毛片机地男男性行为免费视频播放九九欧美一级毛欧美片 中国一级特大黄片 中文字幕乱码视频 中国成人在线视频 啪啪啪视频欧美 中国一级黄色电影 啪啪啪性爱动态图 啪啪啪欧美视频 黄色一级片播放 中文字幕乱码免费 黄片BB亚洲AV无码天堂www亚洲国产韩国欧美在线不卡一级 黄色一级毛毛片 中文娱乐在线视频 黄色一级黄色片 中文字幕乱伦电影 黄色一级片aa 中文字幕第十七页 男男性行为免费视频播放 黄色一级片观看 毛片免费电影
кŷһһĺ _ƬBBAVwww޹ŷ߲һ _ëƬΪƵžžŷһëŷƬ _žĿƬɫۺѸƵ60ŷϸһë кŷһһĺ Ļʮҳ Ļ Ļ 9ҳ Ļҹ ƬBBAVwww޹ŷ߲һ Ļŷպ д֪Ļ žžžƵ Ļۺ ĻȷԴ Ļ ĻԴվ кŷһһĺ Ļͨ Ļ۵Ӱ Ƶ ĻԴ ëƬѹۿ ɫһӰƬ йŮһƬ ĻȷӰ ƬBBAVwww޹ŷ߲һ ŷ߲ Ļ žĿƬɫۺѸƵ60ŷϸһë Ļ86ҳ Ļ91 ɫһ¼Ƭ Ļ ɫһŷƬ ëƬƵ Ļ 7ҳ Ļÿո ëƬԿ Ļ кŷһһĺ žĿƬɫۺѸƵ60ŷϸһë кŷһһĺ ëƬΪƵžžŷһëŷƬ ɫһ¼ Ƶۿ ĻAvӰ žžžŮƵ Ļ йһػƴƬ ĻþƵ ĻɫƵ йɫһƬ ĻֻƬ Ļ ŷһë кŷһһĺ Ƶ žžƵվ Ļ ɫһƬ Ļ ĻƵ ëƬַ йһɫƬ Ļ žžƵ߲ žĿƬɫۺѸƵ60ŷϸһë кŷһһĺ žžƵ AV Ļ˿ ëƬƵ ëƬߵӰ ëƬ߲ žžž߲ ĻӰ ĻĻ Ļһ ëƬѻ ĻۺӰԺ Ļֻ ĻƵ ëƬΪƵžžŷһëŷƬ йһشƬ ĻƵ йƵ žžžƵŷ йһɫӰ žžž԰̬ͼ žžžŷƵ ɫһƬ Ļ ƬBBAVwww޹ŷ߲һ ɫһëëƬ Ƶ ɫһɫƬ Ļ׵Ӱ ɫһƬaa Ļʮҳ ΪƵ ɫһƬۿ ëƬѵӰ
ENTER NUMBET 0018